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RECOMBINANT RTOONUCLEASE PROTEINS 

FIELD OF THE INVENTION 

This invention relates to the production of 
ribonuclease molecules which are toxic to cells of interest. 

BACKGROUND OF THE INVENTION 
15 Ribonucleases such as ribonuclease A ( "RNase A") 

and their cytotoxicity toward tumor cells are well documented 
from studies performed in the 1960s and 1970s and reviewed in 
Roth, J., 1963, Cancer Res. 23:657-666. Human serum was also 
discovered to contain several RNases (Reddi, E., 1975, 
20 Bxochem. Biophys . Res. Commun. 67:110-118, Blank et al . , Human 
body fluid ribonucleases: detection, interrelationships and 
significance 1-203-209 (IRL Press, London, 1981)) that are 
expressed in a tissue specific manner. The proteins involved 
in the host^'def ense^actfivitiT 6f the eosinophil are homologous 
25 to RNases and express RNase activity (Gleich et al . . 1986, 
Proc. Natl. Acad. Scl . , USA 83:3146-3150; Slifman et al . . 
1986, J. Immunol, 137:2913-2917). Thus, htunan serum RlSTases 
were believed to also have host defense activities. 

Further to these early studies was the discovery 
3 0 that an anti- tumor protein from oocytes of Rana piplens has 
homology to RNase A (Ardelt et al.. I99i, j. Biol. Chem. 
256:245-251) , This protein has been termed ONCONASE®, 
Alfacell Corporation, N.J. See also e.g., Darzynkiewicz et 
al. (1988) Cell Tissue Kinet. 21, 169-182, Mikulski et al . 
35 (1990) Cell Tissue Kinet. 23, 237-246. This protein is also 
described in U.S. Patent No. 4,888,172. Phase I and Phase 
I/II clinical trials of ONCONASE® as a single therapeutic 
agent in patients with a variety of solid tumors (Mikulski 
et al. (1993) Int. J. of Oncology 3, 57-64) or combined with 
tamoxifen in patients with advanced pancreatic carcinoma have 
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recently been completed (Chun et al. (1995) Proc Amer Soc Clin 
Oncol 14 No. 157, 210) . Conjugation of ONCONASE® to cell- 
type- specif ic ligands increased its potency towards tumor 
cells (Rybak et al. (1993) Drug Delivery 1, 3-10). Taken 
5 together, these results indicate that ONCONASE® has properties 
that are advantageous for the generation of a potent selective 
cell killing agent. 

However, since this is not a human -derived protein, 
it is prone to stimulating undesirable immune responses when 

10 used in humans. Thus, it would be desirable to retain the 
potent cytotoxic properties of this molecule while reducing 
its immunogenicity in humans. Further, it would be desirable 
to produce derivations of this molecule recombinantly so that 
it may be better chemically conjugated or recombinantly joined 

15 to other molecules for targeting to specific cells. Until the 
invention described herein, it has proven difficult to 
recombinantly express an active cytotoxic molecule related to 
ONCONASE®. Though it was thought that the methionine -glutamic 
acid amino terminal end of the recombinant molecule prohibited 

20 the molecule from having significant enzymatic activity, a 

means to solve this problem has not been forthcoming until the 
invention herein. 

Further, although advances in protein design 
techniques promise to alleviate some of the immunogenicity 

25 associated with the antibody portion of immunotoxins (Bird et 
al., 1988, Science 242:423; Huston et al , , 1988, Proc Natl 
Acad Sci USA 85:5879; Ward et al , , 1989, Nature 341:544), no 
solution has been forthcoming for the immunogenicity of the 
toxin portion other than immunosuppression of the patients 

30 (Khazaeli et al 1988, Proceedincrs of AACR 29:418). Thus, 
there is a continuing need for methods and compositions that 
would reduce the immunogenicity of the Rana. pipiens- derived 
toxic moiety. 

Non- cytotoxic human members of the RNase A 

35 superfamily linked to tumor associated antigens by chemical 
(Rybak et al. (1991) j. Biol. Chem 266, 1202-21207, Newton 
et al. (1992) J. Biol. Chem. 267, 19572-19578) or recombinant 
means (Rybak et al , Proc. Natl, Acad, Sci. U.S.A. 89, 3165, 
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Newton et al . (1994) J Biol Chem. 269, 26739-26745 have been 
shown to offer a strategy for selectively killing tiomor cells 
with less innnunogenicity than current strategies employing 
plant and bacterial toxins Rybak, S.M. & Youle, R.J. (I99i) 
5 Inrnamol. and Allergy Clizij.cs of North America. 11:2, 359-380. 
Human -derived ribonucleases of interest include eosinophil - 
derived neurotoxin (EDN) and angiogenin. 

SUMMARY OF THE INVENTION 
^° ^ave discovered how to construct RNases which are 

highly cytotoxic and which are modifications of the native 
ONCONASE® (nOnc) . When the nOnc was expressed recombinantly 
it was not found to have significant cytotoxicity. Our 
modified versions (rOnc) , however, are highly cytotoxic and 
15 otherwise retain the advantages of the native ONCONASE® 
molecules, while in some cases they also have increased 
cytotoxic properties. The rOnc molecules may be used alone or 
conveniently used to form chemical conjugates, as well as to 
form targeted recombinant immunofusions . These rOnc molecules 
20 can be used to decrease tumor cell growth. An effective - 
recombinant form of nOnc advantageously permits the 

recombinant-mol-ecure"to" ble^f used to "other "therapeut ic~or 

targeting molecules of interest recombinantly. Further, the 
rOnc molecule can be modified to enhance cytotoxicity as will 
25 be seen below. Our nOnc-derived molecules are also desirable 
because nOnc is a unique ribonuclease in that it can be 
administered alone directly to patients to decrease and 
inhibit tumor cell growth without the use of a targeting 
agent . 

The present invention also includes methods of 
selectively killing cells using a rOnc joined to a ligand to 
create a selective cytotoxic reagent of the present invention. 
The method coii^rises contacting the cells to be killed with a 
cytotoxic reagent of the present invention having a ligand 
35 binding moiety that specifically delivers the reagent to the 
cells to be killed. This method of the present invention may 
be used for cell separation in vitro by selectively killing 
unwanted types of cells, for example, in bone marrow prior to 
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transplantation into a patient undergoing marrow ablation by 
radiation, or for killing leukemia cells or T- cells that would 
cause graft -versus -host disease. The toxins can also be used 
to selectively kill unwanted cells in culture. 

Humanized versions of our rOnc molecules are also 
described which graft portions of mammalian or human- derived 
RNases such as angiogenin or hximan eosinophil derived 
neurotoxin (EDN) to the rOnc-derived molecules. A preferred 
embodiment of the invention is a molecule where the amino 
terminal end of EDN is placed onto the amino terminal end of 
the rone molecules. The surprising properties of these hybrid 
proteins with regard to ribonuclease activity and in vitro 
anti- tumor effects are described. 

15 BRIEF DESCRIPTION OF THE DRAWINGS 

FIGX7RE LEGENDS 
Figure 1 shows the deduced amino acid sequence of 
the Rana clone 9 (SEQ ID NO:2) . described below and sequence 
alignment with the amino acid sequence of nOnc (SEQ ID NO:i) . 
The bold print indicates identical residues between nOnc and 
Rana clone 9. The dots indicate missing amino acids in the 
PGR clone. 

^5 Figures 2A and 2B show the configuration of the DNA 

constructs exenplified in the examples. The PCR product 
obtained from J?ana pipiens DNA is identified as Rana 9. The 
N-and C-termini are synthetically filled in and identified as 
One in the constructs encoding [Met- ( -i) ] rOnc or EDN for the 

30 N- terminal EDN/Onc hybrid. Corresponding amino acid residues 
are indicated below each construct. Figure 2B shows the 
sequence alignment of the N- terminal sequences of nOnc (SEQ ID 
N0:3). rEDN (SEQ ID N0:4), [Met - ( - 1) ] rOnc containing a Gly (G) 
instead of Asp in position 20 (SEQ ID NO:5), rEDNjj^.jx) ^Oiic 

35 with an Asp in amino acid position 26 (SEQ ID NO:6) and 

rEDN(j^.2i)rOncG26 with a Giy in position 26 (SEQ ID NO:7). 
Bold letters indicate conserved residues, capital letters show 
the sequence deduced from Rana clone 9 . 
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Figrures 3A-3D show the inhibition of protein 
synthesis in human tumor cells by nOnc, rEDN, [Met- ( - 1) ] rOnc 
or hybrid proteins. Cells (10^) were plated in individual 
96 -well microtiter culture plates and treated with varying 
concentrations of each agent for 4 8 h. Cell viability was 
determined as described in the Example Section below. Result 
from more than one individual experiment were combined to 
yield the mean data points. Standard errors of the means, 
when they are greater than the symbol, are shown. Cell lines 
ACHN, renal cancer (Fig.3A); MDA-MB-231 (Fig. 3B) and HS 578T 
(Fig. 3D), breast cancer; SF-539 (Fig. 3C) , CNS, cancer. EDN 
(open triangles); nOnc (open squares); [Met- ( - 1) ] rOnc, (solid 
triangles); rEDN^^^.j^^j rOnc , (open circles); 
rEDN^j^. 21) ^OncG26 (solid circles) . 

Figure 4 shows a sequence alignment of some members 
of the RNase A superfamily: Frog lectin is from Rana 
catesJbeiana, ONCONASE®, EDN, ECP (human eosinophil cationic 
protein) , Ang is bovine angiogenin, Seminal is bovine seminal 
RNase, and RNase A is bovine pancreatic RNase A (SEQ ID NOs:8, 
1 and 9-13, respectively). Amino acids conserved in all ' 
members- are capitalized, " and" active site" residues~Hl2^r ^1 , 
and H119 (RNase A numbering) are marked with an asterisk. 

Figure 5 shows the inhibition of protein synthesis 
by MetSerOnc and MetSer- or MetGlu-OncFvs . The cytotoxic 
effect of the single chain antibody rOnc fusion proteins; 
E6FB[Met- ('l)]SerrOnc (closed circles) , [Met - ( - 1) ] SerrOnc- 
AngFBE€ ( open squares ) and [Met- ( -l) ] GlurOncFBE6 (closed 
squares) were compared to the non- targeted recombinant 
protein, [Met- ( -1) ] SerrOnc (open circles), by determining 
inhibition of protein synthesis in SF 539 cells. Cells were 
plated into 96 -well microtiter plates in Dulbecco's minimum 
essential medium supplemented with 10% heat- inactivated fetal 
bovine serum. Additions were made in a total volume of 10 /il 
and the plates were incxibated at 37« for 3 days. Phosphate 
buffered saline containing 0.1 mCi of [^^C] leucine was added 
for 2-4 h and the cells were harvested onto glass fiber 
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filters using a PHD cell harvester, washed with water, dried 
with ethanol and counted. The results were expressed as 
per cent of [^^C] leucine incorporation in the mock- treated 
wells. 

5 

Figures 6A and €B show inhibition of protein 
synthesis in an assay as described for Figures 3A-3D using 
cell line SF539, human glioma cells and rOnc fusion proteins 
designated MetLysTryrOnc (open circles. Fig. 6A) ; 
10 MetAlaAlaTyrOnc (closed circles. Fig. 6A) ; and rOnc fusion 

proteins with signal peptides, MetKDELSerrOnc • (open circles, 
Fig. 6B) and MetNLSSerrOnc (closed circles Fig. 6B) . 

Plffure 7 shows inhibition of protein synthesis in an 
15 assay as described for Figures 3A-3D using cell line SF539, 
human glioma cells and comparing three fusion proteins 
corresponding to MetSerOnc (SEQ ID NO: 39 with a Met-Ser amino 
terminal end): MetSerOnc (closed circles), MetSer0ncC4 
(MetSerOnc with a Cys at amino acid position 5 of SEQ ID 
20 NO: 39, closed squares) and MetSerOncC72 (MetSerOnc with a Cys 
at amino acid position 73 of SEQ ID N0:39, open circles). 

DETAILED DESCRIPTION 
This invention provides highly active and cytotoxic 
25 ribonuclease molecules which can be used to selectively kill 
and target cells, particularly tumor cells. In some 
embodiments the molecules are designed to incorporate 
sequences from human derived ribonucleases which are also 
highly active and cytotoxic, but which have the further 
advantage in that they are less immunogenic in hiomans . The 
rOnc molecules of the present invention are those which are 
recombinant nOnc- derived sequences. 

The nOnc molecule has an cunino acid sequence set 
forth in SEQ ID N0:1. Bovine pancreatic RNase A has an amino 
35 acid sequence set forth in SEQ ID NO: 13. Unless otherwise 

indicated, the amino acid sequence positions described herein 
use as a frame of reference the bovine pancreatic RNase A 
sequence in SEQ ID NO: 13 as this is the reference sequence 
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commonly used in the RNase field. It should be understood 
that such position designations do not indicate the number of 
amino acids in the claimed molecule per se, but indicate where 
in the claimed molecule the residue occurs when the claimed 
molecule sequence is aligned with bovine RNase. 

The rOnc molecules described and claimed herein will 
preferably have cysteine residues at amino acid positions 
corresponding to amino acid positions 26, 40, 58, 84, 95 and 
110; a lysine at position 41 and a histidine at position 119 
with reference to the bovine RNase A, SEQ ID NO: 13 (such 
positions correspond to amino acid position numbers 19, 30, 
48, 68, 75 and 90 and 87 and 104 of the nOnc sequence 
respectively set out in SEQ ID NO:l). 

The rOnc molecules of this invention are those that 
15 have measurable ribonuclease activity, as defined below. The 
ribonucleases will also have (a) an amino terminal end 
beginning with a methionine which is followed by any amino" 
acid other than glutamic acid (Glu) ,- (b) a cysteine at amino 
acid positions 26, 40, 58, 84, 95 and 110; a lysine at 
position 41 and a histidine at position 119, such positions 
being determined with reference to those in the amino acid _ 
seqlience" of' bovdne "imas^e and (c) an 

nOnc- derived amino acid sequence. 

Preferably, the rOnc molecules will have an amino 
25 terminal end selected from the group consisting of: 

Met -Ala; 

Met-Ala-Ala-Ser; 
Met-Arg; 
Met- (J) ; 
3 0 Met-Lys- (J) ; 

Met-Arg- (J) ; 
Met-Lys; 
Met-Lys -Pro; 

Met-Lys- (J) -Pro (SEQ ID NO: 14) ; 
35 Met-Lys -Pro- (J) (SEQ ID NO:15) ; 

Met-Asn; 
Met -Gin; 
Met-Asn- (J) ; 
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Met -Gin- (J) ; 

Met-Asn- (J) -Pro (SEQ id N0:16); 
Met- (J) -Lys; 

Met- (J) -Lys-Pro (SEQ ID NO: 17); and 
5 Met- (J) -Pro- Lys (SEQ ID NO:18); 

where (J) is Ser, Tyr or Thr. 

Further, it is preferred that the rOnc molecules be 
modified so that the aspartic acid of amino acid position 2 of 
nOnc (position 4 with reference to the sequence of bovine 
10 RNase A) is deleted or replaced by Ala or Asn. 

In alternative forms of the rOnc molecules, the 
molecules will employ an amino terminal end encoded by a 
sequence derived from the amino terminal end of EDN followed 
by a sequence from rOnc. In such forms, it is preferred that 
the amino acid sequence is one selected from the group 
consisting of those sequences substantially identical to those 
of a formula: 

wherein Met(-l) refers to an amino terminal residue 
of Met; wherein EDN^^^.j^^ refers to a contiguous sequence of 
amino acids of a length beginning at amino acid position l of 
EDN (SEQ ID NO: 9) and continuing to and including amino acid 
position "m» of EDN; wherein Onc^^.^Q^^ refers to a sequence of 
contiguous amino acids beginning at amino acid position "n" 
and continuing to and including amino acid position 104 as set 
out in SEQ ID NO:l; such that: 

when m is 21, n is 16 or 17; 

when m is 22, n is 17 

when m is 20, n is 16; 

when m is 19, n is 15 

when m is 18, n is 14; 

when m is 17, n is 12 or 13; 

when m is 16, n is 11, 12, 13 or 14; 

when m is 15, n is 10; 

when m is 14, n is 9; 

when m is 13, n is 8; and 

when m is 5 , nisi. 
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In other alternative embodiments, the rOnc molecule 
will be fused at the carboxyl end to a sequence from 
angiogenin, such as the sequence exemplified in SEQ ID NO:ii 
or that at amino acid positions 101 to 107 of SEQ ID NO:20. 
The nucleic acid sequence for human angiogenin is knovm and is 
set out in U.S. Patent Application No. 08/125,462. 

Preferred rOnc nucleic acid sequences are those that 
encode preferred rOnc amino acid sequences which are 
substantially identical to those in SEQ ID NOs:20, 22, 24, 26, 
2 8 and 30 (corresponding nucleic acid sequences are set out in 
SEQ ID N0s:19, 21. 23, 25, 27 and 29, respectively). Most 
preferred rOnc amino acid sequences are those that are 
substantially identical to those set forth in SEQ ID NOs:20, 
22, 24 and 26. Their corresponding nucleic acid sequences are 
also preferred and are set out in SEQ ID N0s:19, 21, 23 and 
25, including conservatively modified variants thereof. -The 
most preferred sequence includes SEQ ID N0:22, one which 
en^loys an amino terminal end comprising 1 to 21 (typically 
21) amino acids of the amino terminal end of EDN grafted' on to 
16 to 104 amino acids of the nOnc sequence, with amino acid 
r esid ue 20 in nOnc (Asp) being replaced with Gly. Preferred 
rOnc sequences further will contain optionally a Cys at a 
position corresponding to amino acid position 5, or 73 or Ala 
at amino acid position 88 in place of Cys with reference to 
SEQ ID NO: 39. 

Comparisons of the rOnc sequences provided here can 
be made to described sequences in the pancreatic RNase A 
superfamily. Many of such members are known and include, but 
are not limited to, frog lectin from Rana cazeabelana (Titani 
etal., Biochemistzy 26:2189 (1987)); ONCONASE® (Ardelt, W. 
et al., J. Biol. Chem. 266:245 (1991)); eosinophil derived 
neurotoxin (EDN) (Rosenberg et al . , supra); human eosinophil 
cationic protein (ECP) (Rosenberg et al . , J. Exp. Med. 170:163 
(1989)); angiogenin (Ang) (Fett, J.w. etal.. Biochemistry 
24:5480 (1985)); bovine seminal RNase (Preuss et al . , Nuc. 
Acids. Res. 18:1057 (1990)); and bovine pancreatic RNase 
(Beintama et al . , Prog. Biophys. Mol. Biol. 51:165 (1988)), 
references for all such proteins are incorporated by reference 
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herein, Amino acid sequence alignment for such RNases are 
also set out in Pig. 4 and in Youle et ai . , Crit. Rev. Ther. 
Drug. Carrier Systems 10:1-28 (1993) and in U.S. Patent 
Application Serial No, 08/125,462, which is incorporated by 
reference herein. 
Definitions . 

Unless defined otherwise herein, all technical and 
scientific terms used herein have the same meaning as commonly 
understood by one of ordinary skill in the art to which this 
invention belongs. Singleton et al , (1994) Dxctzionary of 
Microbiology and Molecular Biology, second edition, John Wiley 
and Sons (New York), and Hale and Marham (1991) The Harper 
Collins Dictionary of Biology, Harper Perennial, NY provide 
one of skill with a general dictionary of many of the terms 
used in this invention. Although any methods and materials 
similar or equivalent to those described herein can be used in 
the practice or testing of the present invention, the 
preferred methods and materials are described. For purposes 
of the present invention, the following terms are defined 
below. 

Amino acids may be referred to herein by either 
their commonly known three letter symbols or by the one- letter 
symbols recommended by the lUPAC-IUB Biochemical Nomenclature 
Commission. Nucleotides, likewise, may be referred to by 
their commonly accepted single- letter codes. 

The terms "measurable ribonuclease activity" or 
"significant ribonuclease activity" refer to a molecule which 
has an IC5Q (ng/ml) of less than 40 when added to a rabbit 
reticulocyte lysate assay wherein protein synthesis is 
inhibited as measured by the incorporation of [^^S] methionine 
into acid precipitable protein. IC50 is the concentration of 
protein necessary to inhibit protein synthesis by 50% in the 
assay. The lysate assay may be done as described in the 
Promega lysate assay kit which is commercially available from 
Promega Corporation, Madison, WI. Ribonuclease activity using 
high molecular weight RNA and tRNA is determined at 370C 
through the formation of perchloric acid soluble nucleotides 
following published protocols (Newton, D.L., et al . (1996) 
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Biochemistry 35:545-553) . With poly (A, C) UpG and poly U, 
ribonuclease activity is assayed according to DePrisco et al . , 
and Libonati and Floridi (DePrisco, R. , et al . (1984) 
Biochimica. et Biophysica Acta. 788:356-363; Libonati, M. et al . 
(1969) European J. Biochem. 8:81-87). Activity is assayed by 
measuring the increase with time in absorbance at 260 nm. 
Incubation mixtures (1 ml of 10 mM imidazole, 0.1 M NaCl , pH 
6.5 or pH 7) contain substrate and appropriate amounts of 
enzyme solution at 25°C. The in vitro translation assay (St. 
Clair, D.K., et al. (1987) Proc. Natl. Acad. Sci. 84:8330- 
8334) and the cell viability assays using the (3-[4,5- 
Dimethylthiazol-2-yl] -2 , 5-diphenyltetrazolium bromide; 
Thiazolyl blue) (MNTT) (Mossman, T. (1983) J. Immunol. Methods 
65:55-63) are performed as previously described (Pearson, 
J.W., et al. (1991) J". Natl. Cancer Inst. 83:1386-1391). 

An "none -derived" amino acid sequence is one that 
contains at least one string of six contiguous amino acids 
which is identical to a contiguous sequence of six amino acids 
selected from the group of sequences beginning at amino acid 
positions 1 (with Glu replacing pyroGlu) , 2, 3, 4, 5, 6, 7, 8, 

_ i" ^1 _ ^ ^ '_ ^ L 20 , 23, 24^ 25,26,27, 

28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 41, 42, 43.~ 44. 45. 
46, 47, 50, 52, 54, 56, 59, 60, 61, 62, 63, 64, 65, 66, 67. 
68, 69, 70, 71, 72, 73. 74, 76, 80, 81, 82, 84, 85, 86, 87, 
91, 92, 93, 95, or 96 of the nOnc amino acid sequence (SEQ ID 
NO:l) . 

"Conservatively modified variations" of a particular 
nucleic acid sequence refer to those nucleic acids which 
encode identical or essentially identical amino acid 
sequences, or where the nucleic acid does not encode an amino 
acid sequence, to essentially identical sequences. Because of 
the degeneracy of the genetic code, a large number of 
functionally identical nucleic acids encode any given 
polypeptide. For instance, the codons GCA, GCC, GCG and GCU 
all encode the amino acid alanine. Thus, at every position 
where an alanine is specified by a codon, the codon can be 
altered to any of the corresponding codons described without 
altering the encoded polypeptide. Such nucleic acid 
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variations are "silent variations," which are one species of 
conservatively modified variations. Every nucleic acid 
sequence herein which encodes a polypeptide also describes 
every possible silent variation of the nucleic acid. One of 
skill will recognize that each codon in a nucleic acid (except 
AUG, which is ordinarily the only codon for methionine) can be 
modified to yield a functionally identical molecule. 
Accordingly, each silent variation of a nucleic acid which 
encodes a polypeptide is implicit in each described sequence. 
Furthermore, one of skill will recognize that individual 
substitutions, deletions or additions which alter, add or 
delete a single amino acid or a small percentage of amino 
acids in an encoded sequence are "conservatively modified 
variations** where the alterations result in the substitution 
of an amino acid with a chemically similar amino acid. 
Conservative substitution tables providing functionally 
similar aunino acids are well known in the art. The following 
six groups each contain amino acids that are conservative 
substitutions for one another: 

1) Alanine (A) , Serine (S) , Threonine (T) ; 

2) Aspartic acid (D) , Glutamic acid (E) ; 

3) Asparagine (N) , Glutamine (Q) ; 

4) Arginine (R) , Lysine (K) ; 

5) Isoleucine (I) , Leucine (L) , Methionine (M) , Valine (V) ; 
and 

6) Phenylalanine (F) , Tyrosine (Y) , Tryptophan (W) . 

See also, Creighton (1984) Proteins W.H, Freeman and Company. 

The terms "isolated" or "biologically pure" refer to 
material which is substantially or essentially free from 
components which normally accompany it as found in its 
naturally occurring environment. The isolated material 
optionally comprises material not found with the material in 
its natural environment. The rOncs described herein are 
isolated and biologically pure since they are recombinantly 
produced in the absence of unrelated J^ana pipiens proteins. 
They may, however, include heterologous cell components, a 
ligand binding moiety, a label and the like. 
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The term "nucleic acid" refers to a 
deoxyribonucleotide or ribonucleotide polymer in either 
single- or double -stranded form, and unless otherwise limited, 
encompasses known analogues of natural nucleotides that 
hybridize to nucleic acids in a manner similar to naturally 
occurring nucleotides. Unless otherwise indicated, a 
particular nucleic acid sequence includes the complementary ' ■ 
sequence thereof. A nucleic acid encodes another nucleic acid 
where it is the same as the specified nucleic acid, or 
complementary to the specified nucleic acid. 

An "expression vector" includes a recombinant 
expression cassette which includes a nucleic acid which 
encodes a rOnc polypeptide which can be transcribed and 
translated by a cell. a recombinant expression cassette is a 
nucleic acid construct, generated recombinantly or 
synthetically, with a series of specified nucleic acid 
elements which permit transcription of a particular nuclei'c 
acid in a target cell. The expression vector can be part- of a 
plasmid, virus, or nucleic acid fragment. Typically, the 
recombinant expression cassette portion of the expression • 
vector includes a nucleic acid to be transcribed, and a 
"promoter^ 



The term "recombinant" when used with reference to a 
protein indicates that a cell expresses a peptide or protein 
encoded by a nucleic acid whose origin is exogenous to the 
cell. Recombinant cells can express genes that are not found 
within the native (non- recombinant ) form of the cell. 
Recombinant cells can also eaqjress genes found in the native 
form of the cell wherein the genes are re- introduced into the 
cell by artificial means, for example under the control of a 
heterologous promoter. 

The term "subsequence" in the context of a 
particular nucleic acid or polypeptide sequence refers to a 
region of the nucleic acid or polypeptide equal to or smaller 
than the particular nucleic acid or polypeptide. 

"Stringent hybridization wash conditions" in the 
context of nucleic acid hybridization experiments such as 
Southern and northern hybridizations are sequence dependent. 
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and are different under different environmental parameters. 
An extensive guide to the hybridization of nucleic acids is 
found in Tijssen (1993) Lahoratory Techniques in Biochemistry 
and Molecular Biology- -Hybridization with Nucleic Acid Probes 
Part I, Chapter 2 "Overview of principles of hybridization and 
the strategy of nucleic acid probe assays", Elsevier, New 
York. Generally, highly stringent wash conditions are 
selected to be about 5° C lower than the thermal melting point 
<'^m) the specific sequence at a defined ionic strength and 

ph. The is the temperature (under defined ionic strength 
and pH) at which 50% of the target sequence hybridizes to a 
perfectly matched probe. Very stringent conditions are 
selected to be equal to the T^^ point for a particular probe. 
Nucleic acids which do not hybridize to each other under 
stringent conditions are still substantially identical if the 
polypeptides which they encode are substantially identical. 
This occurs, e.g*, when a copy of a nucleic acid is created 
using the maximum codon degeneracy permitted by the genetic 
code * 

The term "identical" in the context of two nucleic 
acid or polypeptide sequences refers to the residues in the 
two sequences which are the same when aligned for maximum 
correspondence over a specified comparison window. When 
percentage of sequence identity is used in reference to 
proteins or peptides it is recognized that residue positions 
which are not identical often differ by conservative amino 
acid substitutions, where amino acid residues are siibstituted 
for other amino acid residues with similar chemical properties 
(e.g. charge or hydrophobicity) and therefore do not change 
the functional properties of the molecule. Where sequences 
differ in conservative substitutions, the percent sequence 
identity may be adjusted upwards to correct for the 
conservative nature of the substitution. Means for making 
this adjustment are well-known to those of skill in the art. 
Typically this involves scoring a conservative sxabstitution as 
a partial rather than a full mismatch, thereby increasing the 
percentage sequence identity. Thus, for example, where an 
identical simino acid is given a score of 1 and a 
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non- conservative substitution is given a score of zero, a 
conservative substitution is given a score between zero and i. 
The scoring of conservative substitutions is calculated, e.g-,, 
according to the algorithm of Meyers and Miller, Computer 
Applic. Biol. Sci., 4: 11-17 (1988) e.^., as implemented in 
the program PC/GENE ( Intel 1 igenet i cs , Mountain View, 
California, USA) , 

Methods of alignment of sequences for comparison are 
well-known in the art. Optimal alignment of sequences for 
comparison may be conducted by the local homology algorithm of 
Smith and Waterman (1981) Adv. Appl . Mstth. 2: 482; by the 
homology alignment algorithm of Needleman and Wunsch (1970) J. 
Mol. Biol. 48: 443; by the search for similarity method of 
Pearson and Lipman (1988) Proc. Natl. Acad. Scl. USA 85: 2444; 
by computerized in^lementations of these algorithms 
(including, but not limited to CLUSTAL in the PC/Gene program 
by Intelligenetics, Mountain View, California, GAP, BESTFIT, 
FASTA, and TFASTA in the Wisconsin Genetics Software Package, 
Genetics Computer Group (GCG) , 575 Science Dr., Madison, 
Wisconsin, USA) ; the CLUSTAL program is well described by 
Higgins ^nd Sharp (198^) Gene^ 73: 237^44 and Higgins and 
Sharp (1989) Computer Applications in Biosciexices si 

151-153; Corpet, et al . (1988) Nucleic Acids Research 16 , 
10881-90; Huang, et al . (1992) Computer- Applications in the 
Biosciences 8, 155-65, and Pearson, et al. (1994) Methods in 
Molecular Biology 24, 307-31. Alignment is also often 
performed by inspection and manual alignment. 

The term "substantial identity" or "substantial 
similarity in the context of a polypeptide indicates that a 
polypeptide comprises a sequence with at least 70% sequence 
identity to a reference sequence, or preferably 80%, or more 
preferably 85% sequence identity to the reference sequence, or 
most preferably 90% identity over a comparison window of about 
10-20 amino acid residues. An indication that two polypeptide 
sequences are substantially identical is that one peptide is 
immunologically reactive with antibodies raised against the 
second peptide. Thus, a polypeptide is substantially 
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identical to a second polypeptide, for example, where the two 
peptides differ only by a conservative substitution. 

One indication that two nucleic acid sequences are 
substantially identical is that the polypeptide which the 
first nucleic acid encodes is iiranunologically cross reactive 
with the polypeptide encoded by the second nucleic acid. 

Another indication that two nucleic acid sequences 
are substantially identical is that the two molecules 
hybridize to each other under stringent conditions. Stringent 
conditions are sequence dependent and are different under 
different environmental parameters. Generally, stringent 
conditions are selected to be about 5° C to 20°C lower than 
the thermal melting point {T„) for the specific sequence at a 
defined ionic strength and pH. The T„ is the temperature 
(under defined ionic strength and pH) at which 50% of the 
target sequence hybridizes to a perfectly matched probe. 
However, nucleic acids which do not hybridize to each other 
under stringent conditions are still substantially identical 
if the polypeptides which they encode are substantially 
identical. This occurs, e.g., when a copy of a nucleic acid 
is created using the maximum codon degeneracy permitted by the 
genetic code. 

The term "specifically deliver" as used herein 
refers to the preferential association of a molecule with a 
cell or tissue bearing a particular target molecule or marker 
and not to cells or tissues lacking that target molecule. It 
is, of course, recognized that a certain degree of non- 
specific interaction may occur between a molecule and a non- 
target cell or tissue. Nevertheless, specific delivery, may 
be distinguished as mediated through specific recognition of 
the target molecule. Typically specific delivery results in a 
much stronger association between the delivered molecule and 
cells bearing the target molecule than between the delivered 
molecule and cells lacking the target molecule. Specific 
delivery typically results in greater than 2 fold, preferably 
greater than 5 fold, more preferably greater than 10 fold and 
most preferably greater than 100 fold increase in amount of 
delivered molecule (per unit time) to a cell or tissue bearing 
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the target molecule as compared to a cell or tissue lacking 
the target molecule or marker. 

The term "residue" as used herein refers to an amino 
acid that is incorporated into a polypeptide. The amino acid 
may be a naturally occurring amino acid and, unless otherwise 
limited, may encompass known analogs of natural amino acids 
that can function in a similar manner as naturally occurring 
amino acids. 

A "fusion protein" or when a molecule is "joined" to 
another refers to a chimeric molecule formed by the joining of 
two or more polypeptides through a peptide bond formed between 
the amino terminus of one polypeptide and the carboxyl 
terminus of another polypeptide. The fusion protein or the 
joined molecules may be formed by the chemical coupling of the 
constituent molecules or it may be expressed as a single 
polypeptide from a nucleic acid sequence encoding a single 
contiguous fusion protein. A single chain fusion protein is a 
fusion protein having a single contiguous polypeptide 
backbone . 

A "ligand" or a "ligand binding moiety", as used 
herein, refers generally to all molecules capable of 
— specifically deriverihg"ar molecule, ^ reactTng with~or"otherwise 
recognizing or binding to a receptor on a target cell. 
Specifically, examples of ligands include, but are not limited 
to, antibodies, lymphokines, cytokines, receptor proteins such 
as CD4 and CD8, solubilized receptor proteins such as soluble 
CD4, hormones, growth factors, and the like which specifically 
bind desired target cells. 

Making rOnc-derived Nucleic Acid s and Polvpeptid^a . 

Several specific nucleic acids encoding rOnc- derived 
polypeptides are described herein. These nucleic acids can be 
made using standard recombinant or synthetic techniques. 
Given the nucleic acids of the present invention, one of skill 
can construct a variety of clones containing functionally 
equivalent nucleic acids, such as nucleic acids which encode 
the same polypeptide. Cloning methodologies to accomplish 
these ends, and sequencing methods to verify the sequence of 
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nucleic acids are well known in the art. Examples of 
appropriate cloning and sequencing techniques, and 
instructions sufficient to direct persons of skill through 
many cloning exercises are found in Berger and Kimniel, Guide 
to Molecular Cloning Techniques, Methods in Bnzymology volume 
152 Academic Press, Inc., San Diego, CA (Berger); Sambrook 
et al. (1989) Molecular Cloning - A Laboratory Manual (2nd 
ed.) Vol. 1-3; and Current Protocols in Molecular Biology, 
P.M. Ausubel et al.. eds.. Current Protocols, a joint venture 
between Greene Publishing Associates, Inc. and John Wiley & 
Sons, Inc., (1994 Supplement) (Ausubel). Product information 
from manufacturers of biological reagents and experimental 
equipment also provide information useful in known biological 
methods. Such manufacturers include the SIGMA chemical 
company (Saint Louis, MO) , R&D systems (Minneapolis, MN) , 
Pharmacia LKB Biotechnology (Piscataway, NJ) , CLONTECH 
Laboratories. Inc. (Palo Alto, CA) . Chem Genes Corp., Aldrich 
Chemical Company (Milwaukee, WI) , Glen Research, Inc., GIBCO 
BRL Life Technologies, Inc. (Gaithersberg, MD) , Fluka 
Chemica-Biochemika Analytika (Fluka Chemie AG, Buchs, 
Switzerland) , Invitrogen, San Diego, CA, and Applied 
Biosystems (Foster City, CA) , as well as many other commercial 
sources known to one of skill. 

The nucleic acid compositions of this invention, 
whether RNA, cDNA, genomic DNA, or a hybrid of the various 
combinations, are isolated from biological sources or 
synthesized in vitro. The nucleic acids of the invention are 
present in transformed or transfected cells, in transformed or 
transfected cell lysates, or in a partially purified or 
substantially pure form. 

In vitro an^jlif ication techniques suitable for 
amplifying sequences for use as molecular probes or generating 
nucleic acid fragments for subsequent subcloning are known. 
Examples of techniques sufficient to direct persons of skill 
through such in vitro amplification methods, including the 
polymerase chain reaction (PGR) the ligase chain reaction 
(LCR) , Q^-replicase amplification and other RNA polymerase 
mediated techniques (e.g., NASBA) are found in Berger, 
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Sambrook et al. (1989) MoleculaLr Cloning - a Laboratory m^uslI 
(2nd Ed) Vol. 1-3; amd Ausubel, as well as Mullis et a J . , 
(1987) U.S. Patent No. 4,683,202; PCR Protocols A Guide to 
Methods and Applications (Innis et al . eds) Academic Press 
Inc. San Diego, CA (1990) (Innis); Amheim & Levinson (October 
1, 1990) C&EN 36-47; The Journal Of NIH Research (1991) 3, 
81-94; (Kwoh et al . (1989) Proc. i^^atl. Acad. Sci . USA 86, 
1173; Guatelli et al. (1990) Proc. Natl. Acad. Sci. USA 87, 
1874; Lomell et al. (1989) J. Clin. Chem 35, 1826; Landegren 
et al., (1988) Science 241, 1077-1080; Van Brunt (1990) 
Biotechnology 8, 291-294; Wu and Wallace, (1989) Gene 4, 560; 
Barringer et al . (1990) Gene 89, 117, and Sooknanan andMalek 
(1995) Biotechnology 13: 563-564. Improved methods of cloning 
in vitro amplified nucleic acids are described in Wallace et 
al., U.S. Pat. No. 5,426,039. 

Oligonucleotides for use as probes, e.g., in in 
vitro rOnc nucleic acid amplification methods, or for use as 
nucleic acid probes to detect rOnc nucleic acids are typically 
synthesized chemically according to the solid phase 
phosphoramidite triester method described by Beaucage and 
Caruthers (1981), Tetrahedron Letts . , 22 (20) : 1859 - 1862 , e.g., 
-usi-ng-an -automated syiithes-rzer,~e".g.',"^as described" in 
Needham-VanDevanter et al . (1984) Nucleic Acids Res.. 
12:6159-6168. Oligonucleotides can also be custom made and 
ordered from a variety of commercial sources known to persons 
of skill. Purification of oligonucleotides, where necessary, 
is typically performed by either native acrylamide gel 
electrophoresis or by anion- exchange HPLC as described in 
Pearson and Regnier (1983) J. Chrom. 255:137-149. The 
sequence of the synthetic oligonucleotides can be verified 
using the chemical degradation method of Maxam and Gilbert 
(1980) in Grossman and Moldave (eds.) Academic Press, New 
York, Methods in Enzymology 65:499-560. 

One of skill will recognize many ways of generating 
desired alterations in a given nucleic acid sequence. Such 
well-known methods include site-directed mutagenesis, PCR 
amplification using degenerate oligonucleotides, exposure of 
cells containing the nucleic acid to mutagenic agents or 
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radiation, chemical synthesis of a desired oligonucleotide 
(e.g., in conjunction with ligation and/or cloning to generate 
large nucleic acids) and other well-knovm techniques. See, 
Giliman and Smith (1979) Gene 8:81-97; Roberts et al . (1987) 
iyrature 328:731-734 and Sambrook et al. (1989) Molecular 
Cloning' - A IiaJbora tojrjr Majiual (2nd Ed) Vol, 1-3; innis, 
Ausbel, Berger, Needham VanDevanter and Mullis (all supra). 

Polypeptides of the invention can be synthetically 
prepared in a wide variety of well -known ways. Polypeptides 
of relatively short size are typically synthesized in solution 
or on a solid support in accordance with conventional 
techniques. See, e.g., Merrifield (1963) J. Am. Chem. Soc. 
85:2149-2154- Various automatic synthesizers and sequencers 
are commercially available and can be used in accordance with 
known protocols. See, e.g., Stewart and Young (1984) Solid 
Phase Peptide Syntihesls, 2d. ed. , Pierce Chemical Co. 
Polypeptides are also produced by recombinant expression of a 
nucleic acid encoding the polypeptide followed by purification 
using standard techniques. 

Making Conservative Modificatio ns of the Nucleic Acids and 
Polypeptides of the invention, 

One of skill will appreciate that many conservative 
variations of the sequences disclosed yield a substantially 
identical rOnc. For example, due to the degeneracy of the 
genetic code, "silent substitutions" (i.e., substitutions of a 
nucleic acid sequence which do not result in an alteration in 
an encoded polypeptide) are an implied feature of every 
nucleic acid sequence which encodes an amino acid. Similarly, 
conservative amino acid substitutions, in one or a few amino 
acids in an sutiino acid sequence are substituted with different 
amino acids with highly similar properties (see, the 
definitions section, supra) , are also readily identified as 
being highly similar to a disclosed amino acid sequence, or to 
a disclosed nucleic acid sequence which encodes an amino acid. 
Such conservatively substituted (or modified) variations of 
each explicitly disclosed sequence are a feature of the 
present invention . 
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One of skill will recognize many ways of generating 
alterations in a given nucleic acid sequence. Such well-lcnown 
methods include site-directed mutagenesis, PGR amplification 
using degenerate oligonucleotides, exposure of cells 
containing the nucleic acid to mutagenic agents or radiation, 
chemical synthesis of a desired oligonucleotide (e.g., in 
conjunction with ligation and/or cloning to generate large 
nucleic acids) and other well-known techniques. See, Giliman 
and Smith (1979) Gene 8:81-97, Roberts et al . (1987) Nature 
328:731-734 and Sambrook, Innis, Ausbel, Berger. Needham 
VanDevanter and Mullis (all supra) . 

Most commonly, polypeptide sequences are altered by 
changing the corresponding nucleic acid sequence and 
expressing the polypeptide. However, polypeptide sequences 
are also optionally generated synthetically using commercially 
available peptide synthesizers to produce any desired 
polypeptide (see, Merrifield, and Stewart and Young, silpra) . 

One of skill can select a desired nucleic acid or 
polypeptide of the invention based upon the sequences provided 
and upon knowledge in the art regarding ribonucleases 
generally. The physical characteristics and general 
properties of ^ases are known to skilled practitioners. The 
specific effects of some mutations in RNases are known. 
Moreover, general knowledge regarding the nature of proteins 
and nucleic acids allows one of skill to select approf^riate 
sequences with activity similar or equivalent to the nucleic 
acids and polypeptides disclosed in the sequence listings 
herein. The definitions section herein describes exen^lary 
conservative amino acid substitutions. 

Finally, most modifications to nucleic acids and 
polypeptides are evaluated by routine screening techniques in 
suitable assays for the desired characteristic. For instance, 
changes in the immunological character of a polypeptide can be 
detected by an appropriate immunological assay. Modifications 
of other properties such as nucleic acid hybridization to a 
target nucleic acid, redox or thermal stability of a protein, 
thermal histeresis, hydrophobicity, susceptibility to 
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proteolysis, or the tendency to aggregate are all assayed 
according to standard techniques. 

rpnc Fvsion proteins and Other T y ierapenMc Mn-i -i , 

The rOnc molecules may also include pharmacological 
agents or encapsulation systems containing various 
pharmacological agents. They typically will include a ligand 
to act as a targeting molecule to direct the rOnc to desired 
cells. The rOnc may be attached directly to a ligand or an 
antisense molecule which will assist in delivering the rOnc. 
See, for example, SEQ ID NOS:40-6l. The rOnc can also be 
engineered to contain a nuclear localization signal ("NLS") 
such as that described in amino acid positions 1 to 7 in SEQ 
ID NO: 32 (and SEQ ID NO: 31) to direct the rOnc within the 
cell. Alternatively, the Met at position 8 and the 
corresponding nucleic acids at positions 22-24 of SEQ ID NO: 31 
has been and can be omitted. The nucleic acid sequence for 
the NLS is nucleic acids 1-21 of SEQ ID NO: 31. A signal 
peptide is also exemplified at amino acid positions 1-25 of 
SEQ ID NO: 63. 

The rOnc molecules are uniquely adapted for gene 
therapy applications. They can be fused to other therapeutic 
agents, for example, they could be fused to an anti-B cell 
lymphoma antibody. For exan^le, as will be explained in more 
detail below, rOnc molecules recombinantly fused to an anti- 
transferrin receptor antibody or an anti- colon cancer antibody 
were active. As mentioned cODOve, nOnc has anti -tumor effects 
in vivo and preferentially kills rapidly dividing cells 
stimulated by serum or growth promoting agents such as ras . 
The molecules are readily internalized in the cell. Their 
activity can be further facilitated by joining them to a 
nuclear localization signal and the like to redirect the 
molecules within the cell. Of particular use in tximor cells 
would be to target telomerase, an enzyme subject to 
degradation by RNase. 

We have found that One synergizes with ras in 
microinjection studies. This means that One and ras have to 
be together in the cell. One does not synergize with ras when 
it enters the cell via its own routing. A CAAX (SEQ ID NO: 33) 
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motif is required to localize ras at the plasma membrane 
(C=Cys, A = an aliphatic amino acid, X = S,M,C,A, or Q, an 
example is Cys-Val- lie-Met (SEQ ID N0:34)). Importantly this 
type of sequence has been shown to target heterologous 
proteins to the plasma membrane (Hancock, J., Cadwallader. K. , 
Paterson, H. and C. Marshall (1991) EMBO J. 10:4033). It 
would be desirable to join the rOnc gene with DNA encoding a 
CAAX (SEQ ID NO: 33) signal as given in the example, or KDEL as 
described below. 

Telomerase is being investigated as a "universal 
cancer target" (G.B. Morin, JNCI. (1995) 87:859). it is an 
RNA protein that is located in the nucleus. It has been shown 
that antisense to telomerase RNA can inhibit the function of 
the enzyme and block the growth of cancer cells (J. Feng 
et al.. Science (1995) 269:1236). RNase can also destroy the 
activity of the enzyme. One can also destroy the activity of 
the enzyme when incubated with a cell extract containing • 
telomerase. An NLS/Onc molecule (such as that set out in SEQ 
ID NO: 32) can be made to route One to the nucleus so that it 
can degrade telomerase. The NLS we used has been shown to 
redirect proteins to the nucleus for the aim of interfering 
—with the- function- of a -nucleaiT anfigeiT Is 7 Biobca M."T7 
Neuberger and A. Cattaneo. (1990) 9:101). Our NLS/Onc 
molecule is effective in killing cells. 

An amino terminal sequence to the recombinant 
molecule may be preferred where it is desirable to translocate 
the molecule into the cytosol of target cells. Such signal 
peptide is typically inserted at the amino end of the protein. 
For example, the first amino acids of the recombinant 
molecules described herein (after Met) could be KDEL (SEQ ID 
NO:64) and would accomplish signalling the molecule to the 
endoplasmic reticulum. Amino acid sequences which include 
KDEL, repeats of KDEL, or other sequences that function to 
maintain or recycle proteins into the endoplasmic reticulum, 
referred to here as "endoplasmic retention sequences" may be 
employed. 

Optionally, the rOnc molecule attached to a ligand 
may include an encapsulation system, such as a liposome or 



BNSDOCID: <WO. 



) 9731116A2J_> 



wo 97/3 1 1 16 PCT/US97/02588 

24 

micelle that contains an additional therapeutic composition 
such as a drug, a nucleic acid {e.g. an antisense nucleic 
acid) , or another therapeutic moiety that is preferably 
shielded from direct exposure to the circulatory system. 
Means of preparing liposomes attached to antibodies are well 
known to those of skill in the art. See, for example, U.S. 
Patent No. 4,957,735, Connor et al . , Pharm. Ther. , 28: 341-365 
(1985) . 

One of skill will appreciate that the ligand 
molecule or other therapeutic component and the rOnc molecule 
may be joined together in any order. Thus, where the ligand 
is a polypeptide, the rOnc molecule may be joined to either 
the amino or carboxy termini of the ligand or may also be 
joined to an internal region of either molecule as long as the 
attachment does not interfere with the respective activities 
of the molecules . 

The molecules may be attached by any of a number of 
means well-known to those of skill in the art. Typically the 
rOnc will be conjugated, either directly or through a linker 
(spacer) , to the ligand. However, where both the rOnc and the 
ligand or other therapeutic are polypeptides it is preferable 
to recombinantly express the chimeric molecule as a single - 
chain fusion protein. 

In one embodiment, the rOnc molecule is chemically 
conjugated to another molecule (e.gr. a cytotoxin, a label, a 
ligand, or a drug or liposome) . Means of chemically 
conjugating molecules are well-known to those of skill. 

The procedure for attaching an agent to an antibody 
or other polypeptide targeting molecule will vary according to 
the chemical structure of the agent. Polypeptides typically 
contain a variety of functional groups; e.g-., carboxylic acid 
(COOH) or free amine (-NHj) groups, which are availcible for 
reaction with a suitable functional group on an rOnc molecule 
to bind the other molecule thereto. 

Alternatively, the ligand and/or rOnc molecule may 
be derivatized to expose or attach additional reactive 
functional groups. The derivatization may involve attachment 
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of any of a number of linker molecules such as those available 
from Pierce Chemical Company, Rockford Illinois. 

A "linker", as used herein, is a molecule that is 
used to join two molecules. The linker is capable of forming 
covalent bonds to both molecules. Suitable linkers are well 
known to those of skill in the art and include, but are not 
limited to, straight or branched- chain carbon linkers, 
heterocyclic carbon linkers, or peptide linkers. Where both 
molecules are polypeptides, the linkers may be joined to the 
constituent amino acids through their side groups (e.g-,, 
through a disulfide linkage to cysteine) . However, in a 
preferred embodiment, the linkers will be joined to the alpha 
carbon amino and carboxyl groups of the terminal amino acids. 

A bi functional linker having one functional group 
reactive with a group on a particular agent, and another group 
reactive with an antibody, may be used to form a desired 
immunocon jugate. Alternatively, derivatization may involve 
chemical treatment of the ligand, e.gr. , glycol cleavage of the 
sugar moiety of a glycoprotein antibody with periodate to 
generate free aldehyde groups. The free aldehyde groups on 
the antibody may be reacted with free amine or hydrazine 

-groups- on an-agent— to bind the agentT thereto ' (See U.S . 
Patent No. 4,671,958). Procedures for generation of free 
sulfhydryl groups on polypeptides, such as antibodies or 
antibody fragments, are also known (See U.S. Pat. No. 

4,659,839) . 

Many procedure and linker molecules for attachment 
of various confounds including radionuclide metal chelates, 
toxins and drugs to proteins such as antibodies are known. 
See, for exart^le, European Patent Application No. 188,256; 
U.S. Patent Nos. 4,671,958, 4,659,839, 4,414,148, 4,699,784; 
4,680,338; 4,569,789; and 4,589,071; and Borlinghaus et al . 
Cancer Res. 47: 4071-4075 (1987) which are incorporated herein 
by reference. In particular, production of various 
immunotoxins is well-known within the art and can be found, 
for example in "Monoclonal Antibody -Toxin Conjugates; Aiming 
the Magic Bullet," Thorpe et al.. Monoclonal AntiJoodies in 
Clinical Medicine, Academic Press, pp. 168-190 (1982), 
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Walctoann, Science, 252: 1657 (1991), U.S. Patent Nos. 
4,545,985 and 4,894,443 which are incorporated herein by 
reference . 

In some circumstances, it is desirable to free the 
rOnc from the ligand when the chimeric molecule has reached 
its target site. Therefore, chimeric conjugates comprising 
linkages which are cleavable in the vicinity of the target 
site may be used when the effector is to be released at the 
target site. Cleaving of the linkage to release the agent from 
the ligand may be prompted by enzymatic activity or conditions 
to which the immunocon jugate is sxibjected either inside the 
target cell or in the vicinity of the target site. When the 
target site is a tvimor, a linker which is cleavable under 
conditions present at the tumor site (e.^-. when exposed to 
tumor -associated enzymes or acidic pH) may be used. 

A number of different cleavable linkers are known to 
those of skill in the art. See U.S. Pat. Nos. 4,618,492; 
4,542,225, and 4,625,014. The mechanisms for release of an 
agent from these linker groups include, for example, 
irradiation of a photolabile bond and acid- catalyzed 
hydrolysis. U.S. Pat. No. 4,671,958, for example, includes a 
description of immunoconjugates comprising linkers which are 
cleaved at the target site in vivo by the proteolytic enzymes 
of the patient's complement system. In view of the large 
number of methods that have been reported for attaching a 
variety of radiodiagnostic compounds, radiotherapeutic 
compounds, drugs, toxins, and other agents to antibodies one 
skilled in the art will be able to determine a suitable method 
for attaching a given agent to an antibody or other 
polypeptide. 

Production of rOnc Mole cules or Fusion Proteins 

Where the molecules of interest are relatively short 
(i.e., less than about 50 amino acids) they may be synthesized 
using standard chemical peptide synthesis techniques. Where 
two molecules of interest are relatively short the chimeric 
molecule may be synthesized as a single contiguous 
polypeptide. Alternatively the molecules may be synthesized 
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separately and then fused by condensation of the amino 
terminus of one molecule with the carboxyl terminus of the 
other molecule thereby forming a peptide bond. Alternatively, 
the molecules may each be condensed with one end of a peptide 
spacer molecule thereby forming a contiguous fusion protein. 

Solid phase synthesis in which the C- terminal amino 
acid of the sequence is attached to an insoluble support 
followed by sequential addition of the remaining amino acids 
in the sequence is the preferred method for the chemical 
synthesis of the polypeptides of this invention. Techniques 
for solid phase synthesis are described by Barany and 
Merrifield, Solid-Phase Peptide Synthesis; pp. 3-284 in The 
Peptides: Analysis, Synthesis, Biology. Vol. 2: Special 
Methods in Peptide Synthesis, Part A., Merrifield, et al . j. 
Am. Chem. Soc, 85: 2149-2156 (1963), and Stewart et al . , 
Solid Phase Peptide Synthesis, 2nd ed. Pierce Chem. Co., 
Rockford, 111. (1984) which are incorporated herein by 
reference. 

In a preferred embodiment, the chimeric fusion 
proteins of the present invention are synthesized using 

recombinant DNA methodology. Generally this involves creating 

a DNA sequence that encodes the fusion proteTn, placing^ the 
DNA in an expression cassette under the control of a 
particular promoter, expressing the protein in a host, 
isolating the expressed protein and, if required, renaturing 
the protein. 

DNA encoding the fusion proteins of this invention, 
as well as the rOnc molecules themselves, may be prepared by 
any suitable method, including, for example, cloning and 
restriction of appropriate sequences or direct chemical 
synthesis by methods such as the phosphotriester method of 
Narang et al , Meth. Enzymol . 68; 90-99 (1979); the 
phosphodiester method of Brown et al., Meth. Enzymol. 68: 
109-151 (1979); the diethylphosphoramidite method of Beaucage 
etai., Tetra. Lett., 22: 1859-1862 (1981); and the solid 
support method of U.S. Patent No. 4,458,066, all incorporated 
by reference herein. 
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Chemical synthesis produces a single- stranded 
oligonucleotide. This may be converted into double- stranded 
DNA by hybridization with a complementary sequence, or by 
polymerization with a DNA polymerase using the single strand 
as a template. One of skill would recognize that while 
chemical synthesis of DNA is limited to sequences of about 100 
bases, longer sequences may be obtained by the ligation of 
shorter sequences. 

Alternatively, subsequences may be cloned and the 
appropriate subsequences cleaved using appropriate restriction 
enzymes. The fragments may then be ligated to produce the 
desired DNA sequence. 

In a preferred embodiment, DNA encoding fusion 
proteins or rOnc molecules of the present invention may be 
cloned using DNA amplification methods such as polymerase 
chain reaction (PGR) . If two molecules are joined together, 
one of skill will appreciate that the molecules may be 
separated by a peptide spacer consisting of one or more amino 
acids. Generally the spacer will have no specific biological 
activity other than to join the proteins or to preserve some 
minimum distance or other spatial relationship between them. 
However, the constituent amino acids of the spacer may be 
selected to influence some property of the molecule such as 
the folding, net charge, or hydrophobicity . 

The nucleic acid sequences encoding the rOnc 
molecules or the fusion proteins may be expressed in a variety 
of host cells, including E. coli, other bacterial hosts, 
yeast, and various higher eukaryotic cells such as the COS, 
CHO and HeLa cells lines and myeloma cell lines. The 
recombinant protein gene will be operably linked to 
appropriate expression control sequences for each host. For 
E. coll this includes a promoter such as the T7, trp, or 
lambda promoters, a ribosome binding site and preferably a 
transcription termination signal. For eukaryotic cells, the 
control sequences will include a promoter and preferably an 
enhancer derived from immunoglobulin genes, SV40, 
cytomegalovirus, etc., and a polyadenylation sequence, and may 
include splice donor and acceptor sequences . 
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The expression vectors or plasmids of the invention 
can be transferred into the chosen host cell by well-known 
methods such as calcium chloride transformation for E. coli 
and calcium phosphate treatment or electroporation for 
mammalian cells. Cells transformed by the plasmids can be 
selected by resistance to antibiotics conferred by genes 
contained on the plasmids, such as the amp, gpt, neo and hyg 
genes . 

Once expressed, the recombinant rOnc or fusion 
proteins can be purified according to standard procedures of 
the art. including ammonium sulfate precipitation, affinity 
columns, column chromatography, gel electrophoresis and the 
like (see, generally, R. Scopes, Protein Purification, 
Springer- Verlag, N.Y. (1982), Deutscher, Methods in 
Enzymology Vol. 182: Guide to Protein Purification. . Academic 
Press, Inc. N.Y. (1990)). Substantially pure compositions of 
at least about 90 to 95% homogeneity are preferred, and 98 to 
99% or more homogeneity are most preferred for pharmaceutical 
uses. Once purified, partially or to homogeneity as desired, 
the polypeptides may then be used therapeutically. 

Accordingly, this invention also provides for host 



cells and expression vectors cortprising the nucleic acid 
sequences described above. 

Further, the present invention includes a method of 
selectively killing cells using a rOnc joined to a ligand to 
create a selective cytotoxic reagent of the present invention. 
The method comprises contacting the cells to be killed with a 
cytotoxic reagent of the present invention having a ligand 
binding moiety that specifically delivers the reagent to the 
cells to be killed. This method of the present invention may 
be used for cell separation in vitro by selectively killing 
unwanted types of cells, for example, in bone marrow prior to 
transplantation into a patient undergoing marrow ablation by 
radiation, for killing leukemia cells or T- cells that would 
cause graft -versus -host disease. 

For methods of use in vivo, preferably the mammalian 
protein of the reagent used in this method is endogenous to 
the species in which the reagent is intended for use. 
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Preferably, for use in humans, the cytotoxic reagent is a 
fusion protein comprising a humanized chimeric antibody and a 
humanized rOnc. Specific in vivo methods of this invention 
include a method for the chemotherapeutic alleviation of 
cancer in mammals comprising administering a cytotoxic amount 
of a selective cytotoxic reagent according to the present 
invention. The methods are particularly useful for treating 
tumors sensitive to the cytotoxic reagents. Tumors of 
particular interest include pancreatic, colon, breast and 
kidney tumors . 

PhanaaceutlGa3 C""r?ff lt:±Qnfl 

The rOnc molecules and fusion proteins employing 
them of this invention are useful for parenteral, topical, 
oral, or local administration, such as by aerosol or 
transdermally, for prophylactic and/or therapeutic treatment. 
The pharmaceutical con^ositions can be administered in a 
variety of unit dosage forms depending upon the method of 
administration. For example, unit dosage forms suitable for 
oral administration include powder, taO^lets, pills, capsules 
and lozenges. It is recognized that the subject molecules and 
fusion proteins and pharmaceutical con^ositions of this 
invention, when administered orally, must be protected from 
digestion. This is typically accomplished either by 
complexing the protein with a composition to render it 
resistant to acidic and enzymatic hydrolysis or by packaging 
the protein in an appropriately resistant carrier such as a 
liposome. Means of protecting proteins from digestion are 
well known in the art. 

The pharmaceutical compositions of this invention 
are particularly useful for parenteral administration, such as 
intravenous administration or administration into a body 
cavity or Ivimen of an organ. The compositions for 
administration will commonly comprise a solution of the 
chimeric molecule dissolved in a pharmaceutical ly acceptable 
carrier, preferably an aqueous carrier. A variety of aqueous 
carriers can be used, e.gr., buffered saline and the like. 
These solutions are sterile and generally free of undesirable 
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matter. These compositions may be sterilized by conventional, 
well known sterilization techniques. The compositions may 
contain pharmaceutically acceptable auxiliary substances as 
required to approximate physiological conditions such as pH 
adjusting and buffering agents, toxicity adjusting agents and 
the like, for example, sodium acetate, sodium chloride, 
potassium chloride, calcium chloride, sodium lactate and the 
like. The concentration of therapeutic molecule in these 
formulations can vary widely, and will be selected primarily 
based on fluid volumes, viscosities, body weight and the like 
in accordance with the particular mode of administration 
selected and the patient's needs. 

Thus, a typical pharmaceutical composition for 
intravenous administration would be about 0.1 to 10 mg per 
patient per day. Dosages from O.l up to about 100 mg per 
patient per day may be used, particularly when the drug is 
administered to a secluded site and not into the blood stream, 
such as into a body cavity or into a lumen of an organ. Actual 
methods for preparing parenterally administrable compositions 
will be known or apparent to those skilled in the art and are 

publications as J?eining'tdn ' s 
Pharznaceutical Science, 15th ed.. Mack Publishing Company^ 
East on, Pennsylvania (1980) . 

The compositions containing the present rOnc 
molecules or the fusion proteins or a cocktail thereof (i.e., 
with other proteins) can be administered for therapeutic 
treatments. in therapeutic applications, compositions are 
administered to a patient suffering from a disease, in a 
cytotoxic amount, an amount sufficient to kill cells of 
interest- An amount adequate to accomplish this is defined as 
a "therapeutically effective dose." Amounts effective for 
this use will depend upon the severity of the disease and the 
general state of the patient's health. 

Single or multiple administrations of the 
compositions may be administered depending on the dosage and 
frequency as required and tolerated by the patient. In any 
event, the composition should provide a sufficient quantity of 
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the proteins of this invention to effectively treat the 
patient . 

All patents, applications and publications cited 
herein are incorporated by reference herein. The following 
examples are provided for illustrative puirposes only and are 
not to be construed as limiting the invention in any way. 

EXAMPLES 

Example !• Cloning and Eacprossion of rOnc and One conjugates 
with EDN* 

A. Materials. Native ONCONASE® ("nOnc") (SEQ ID NO:l) Ardelt 
et al. (1991) J. Biol. Chem. 256, 245-251 and recombinant 
human EDN ("rEDN") (SEQ ID NO:9) Newton et al . (1994) J Biol 
Chem, 269, 26739-26745 were purified from Rana pipiens 
oocytes, NASCO, Fort Atkinson, WI and Escherichia coli, 
respectively, as described. Antibodies to the denatured 
proteins were prepared by Assay Research, Inc., College Park, 
MD. Reagents for performing PGR, and direct cloning of PGR 
products, were obtained from Perkin- Elmer Gorp., Norwalk, CT 
and from Invitrogen, San Diego, CA respectively. Substrates 
for the ribonuclease assays were purchased from Sigma, St. 
Louis, MO and Boehringer Mannheim, Indianapolis, IN. The 
materials and their sources used in the construction and 
expression of the recombinant proteins as well as the rabbit 
reticulocyte lysate are described by Newton et al . , 
Biochemistiry 35:545 (1996). 

B, PGR Cloning of Onconase. Rana pipiens genomic DNA was 
isolated according to standard procedures using proteinase K 
Maniatis, T. , Fritsch, E,F. & Sambrook, J". Molecular Cloning, 
a laboratory manual {Gold Spring Harbor Laboratoiry, Gold 
Spring Harbor, New York, 19 82) . A series of degenerate 
primers were designed to correspond to amino acids in various 
regions of the published nOnc sequence Ardelt et al. (1991) J. 
Biol. Chem, 256, 245-251. The PGR reaction was performed 
according to the manufacturer's instructions using 15 ^g of 
genomic DNA in 100 /il • All reagents except the DNA were 
combined and incubated at 95*»G for 8 min to inactivate any 
residual proteinase K before the addition of the Taq DNA 
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polymerase. PGR was performed for 40 cycles of denaturation 
at 940c for 1 min, annealing for 2 min at 55 »C and primer 
extension for 2 min at 72 ^C. Several pairs of primers yielded 
products of the expected size. The largest product (252 bp) 
was obtained using the forward primer encoding amino acid 
residues 15-23 (AG<GA)GATGT(GT)GATTG(TC)GATAA(CT) ATCATG) (SEQ 
ID NO: 35) and the reverse primer encoding amino acid residues 
90-98 (TGTGA(AG)AA(CT)CAGGC{AC)CC(TA)GT(GT)CA(CT)1-rT) (SEQ ID 
NO: 36). This fragment was subcloned into pCR*™!! by TA 
cloning and a clone carrying an insert of the appropriate size 
was directly sequenced and found to encode amino acid residues 
16-98 of nOnc ("Rana 9") (SEQ ID N0:2) . The corresponding 
nucleic acid sequence is set out in SEQ ID NO: 37. 

C. Plasmid Construction, Expression, Protein Purification and 
in Vitro Assays. The N- and C- termini of nOnc were 
reconstructed using the PGR technique of splicing by overlap 
extension Horten et al. (199 0) BioTechniques 8, 528-532 with 
amino acid residues 1-15 of nOnc or amino acid residues 1-21 
of EDN at the N- terminal and amino acid residues 99-104 cff 
none at the G- terminal. The assembled genes were inserted 
between the Xbal and BamHl sites of the bacterial expression 
vector, pET-lid, Novagen, Madison, WI. All procedures were 
accomplished essentially as described in Newton et al. (1994) 
J Biol Chem. 269, 26739-26745. The plasmids were expressed in 
BL2l(DE3) E. coli cells as recommended by the supplier, 
Novagen, Madison WI. The fusion proteins were isolated from 
inclusion bodies, denatured, renatured and dialyzed as 
described Newton et al. (1994) J" Biol Chem. 269, 26739-26745 
before being applied to a CM-Sephadex C-50 column, Pharmacia 
Biotech Inc.. Piscataway, NJ. The proteins were eluted with a 
NaGl gradient (0-0. 5M) in 20 mM Tris-HGl, pH 7.5, containing 
10% glycerol. Final purification to >95% was achieved by size 
exclusion chromatography on Sephadex G-lOO equilibrated and 
eluted with 5% formic acid. The proteins were pooled, 
concentrated by amicon ultrafiltration using a YM3 membrane 
(or lyophilized) , Amicon, Beverly, MA and dialyzed against 20 
mM Tris-HGl, pH 7.5, containing 10% glycerol before being assayed, 
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Ribonuclease activity using high molecular weight 
RNA and tRNA was determined following published protocols, 
Newton et stl . (1994) J Neurosci 14, 538-544 at 37^0 through 
the formation of perchloric acid soluble nucleotides following 
published protocols (Newton et a J , (1996) Biochem. 35:545- 
553). With poly (A,C), UpG and poly U, ribonuclease activity 
was assayed spectrophotometrically according to DePrisco et 
al., and Libonati and Florida DePrisco et a J . (1984) 
Biochimica et Bxophysica Acta 788, 356-363, Libonati, M. & 
Floridi, A. (1969) European J. Bioch&v. 8, 81-87. Briefly, 
activity was assayed by measuring the increase in absorbance 
at 260 nm. Incubation mixtures (1 ml of 10 mM imidazole, 0.1 
M NaCl, pH 6.5 or pH 7) contained substrate and appropriate 
amounts of enzyme solution at 25 °C. The in vitro translation 
assay, St. Clair et al . (1987) Proc Natl Acad Scz 84, 
8330-8334, and the cell viability assays, Pearson et al . 
(1991) J Natl Cancer Inst 83, 1386-1391, using the 
(3- [4, 5-Dimethylthiazol-2-yl-2, 5-diphenyltetrazolium bromide; 
Thiazolyl blue] (MTT) Mossman, T, (19 83) J". Immunol. Methods 
65, 55-63 were performed as previously described, 

D. Cloning and Expression of [Met-(-l)] rOnc and rOnc 
clLlmeras. Eight different oligonucleotide primers were 
designed to correspond to specific regions in the primary 
amino acid structure of nOnc, Ardelt et al, (1991) J. Biol. 
Chem. 256, 245-251 and amplification of Rana pipxens genomic 
DNA was carried out in a thermal cycler, as described adDove. 
A primer pair corresponding to amino acid residues 15 to 23 
and 9 0 to 98 of nOnc, respectively, generated a 252 bp 
fragment. That PCR product, here denoted Jeana clone 9, was 
cloned into pCR™!! and sequence analysis confirmed that the 
PCR product encoded One (104 amino acids, total) from amino 
acid residue 16 to 98 (Fig, 1) . 

The entire recombinant One ("rOnc") gene (SEQ ID 
NO: 38) was constructed by PCR extension and cloned into an 
expression vector using methodology previously described 
Newton et al . (1994) J- Biol Chem. 269, 26739-26745. The amino 
and carboxyl termini of rOnc were completed by inserting the 
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first 15 and last 6 amino acid residues of nOnc, respectively, 
The configuration of the semi -synthetic rOnc gene is depicted 
at the top of Fig. 2A. The primers were designed to overlap 
with the DNA sequence of the Rana. clone 9 PGR product. The 
plasmid was expressed in BL21(DE3) E. coll and the 
recombinant protein was isolated from inclusion bodies as 
described in Newton et a J . (1994) J Biol Chem. 269, 
26739-26745 before being applied to a CM Sephadex C-50 column. 
Final purification to >95% was achieved by size exclusion 
chromatography. The rOnc obtained from the bacteria in this 
expression system contains an extra methionine at the amino 
terminal [Met-(-l)] (SEQ ID NO:39) in contrast to the 
authentic pyroglutamyl amino acid residue (<Glu-i) of the 
native protein (SEQ ID NO:l) . 

To humanize [Met-(-l)] rOnc while maintaining the 
alignment of the active site residues (Fig. 2B) , the 
N- terminal of jeana clone 9 was also reconstituted with 
oligonucleotides that coded for the first 21 amino acid 
residues of a human eosinophil RNase, EDN (Fig. 2B, 
rEDN(i.2i)0nc) . PGR cloning can result in sequence errors. 
Indeed^ the DNA sequence of the gene encodJ.ng EDN^^ ^21) P^c 
contained an A to G substitution resulting in a change from 
Asp to Gly at position 26 in the chimera (residue 20 in nOnc) 
and is designated as rEDN(i.2i, rOncGjg in Fig. 2B. Another 
plasmid containing encoding rEDN(i.2i) rOnc was sequenced and 
found to have the correct DNA sequence. Since the mutation 
resulted in the substitution of a charged amino acid with a 
small neutral residue the mutant chimera was also expressed 
and characterized for activity. In addition, [Met- ( - i) ] rOnc 
was mutated at position 20 from Asp to Gly (rOncGly2o, Fi^s • 
2A and B) . 

E. Ribonuclease activity of One, EDN, [Met- ( -1) ] rOac and 
hybrid rOnc proteins. Both nOnc (Lin, J.J., et ai . (1994) 
Biochem Biophys Res Comnun 204, 156-162) and EDN (Saxena et 
aJ. (1992) Biol. Chew. 267, 21982-21986) are potent 

inhibitors of in vitro translation in the rabbit reticulocyte 
lysate by mechanisms that depend upon their respective 
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nucleolytic activities. As depicted in Table the addition 
of nOnc or EDN to a rabbit reticulocyte lysate caused the 
inhibition of protein synthesis as measured by the 
incorporation of [^^S] methionine into acid precipitable 
protein. Whereas both nOnc and EDN inhibited protein 
synthesis with ICgQg of 0,2 and 1.3 ng/ml, respectively, 
(Met- (-1) IrOnc, [Met - ( - 1 ) ] rOncG20 , and rEDN(i.2i) ^OncG2 6 were 
considerably less potent HC^os 9^, 28 and 28 ng/ml , 
respectively) . The least active RNase in this assay was 
rEDN^j^.jD^Oi^c with an IC50 of 1600 ng/ml . Placental 
ribonuclease inhibitor (PRI) binds tightly to EDN and inhibits 
its enzymatic activity, Sorrentino et al . (1992) J". Biol. Chem 
267, 14859-14865, yet nOnc activity is very little affected by 
PRI, Wu, Y.N., et ail. (1993) Journsil of Biological Chemistry 
26B, 10686-10693 and Table 1, despite its homology to EDN and 
other members of the pancreatic RNase superfamily. in this 
regard, it is interesting that the activity of r EDN (3^.21) rOnc 
is, like nOnc, barely affected by PRI while the hybrid RNase 
with the Gly mutation now behaves more like EDN in that its 
activity is significantly inhibited (21 fold) by PRI, 

The ribonuclease activity of these proteins was also 
assessed in assays using high and low molecular weight 
substrates. As shown in Taile 2, EDN and nOnc have different 
substrate specificities consistent with previously published 
results (Ardelt et al . (1991) J. Biol. Chem. 256, 245-251, 
Sorrentino et al . (1992) J. Biol. Chem 261, 14859-14865, 
Ardelt et al . (1994) Protein Sci 3, Suppl* 1, 137). 
Consistent with the results presented in Table 1, 
[Met- (-1) ]rOnc (SEQ ID NO:39) and rEDN^j^.ji) rOnc were much less 
active with all of the substrates (non detectable or very 
little activity under the assay conditions employed) . 
Surprisingly, the Gly containing hybrid protein, manifested 
significant ribonuclease activity especially under conditions 
optimal for EDN enzymatic activity. EDN is more active at a 
neutral pH (Sorrentino et al . (1992) J. Biol. Chem 267, 
14859-14865) and as seen in Table 2 there is a marked increase 
in EDN degradation of tRNA at pH 7.5 compared to pH 6 (42.3 
fold) . Also, behaving like EDN, the Gly- containing hybrid 
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increases in activity with a pH shift from 6 to 7.5 (21.7 
fold) while nOnc loses activity at pH 7.5 consistent with its 
pH optimum that ranges from 6-6,5 (Ardelt et a J . (1991) J. 
Biol. Chem. 256, 245-251, Ardelt et al . (1994) Protein Scl 3, 
Suppl* 1, 137) . The enhanced EDN-like activity of the 
Gly- containing hybrid protein is also evidenced by its 
behavior with poly (A, C) which is an excellent substrate for 
EDN. As seen in Table 2, only rEDN^3^.2i) rOncG2 6 expresses 
almost 50% of the enzymatic activity of EDN with this 
substrate whereas the activity of the other RNases are 
negligible. Similar results were observed with poly(U) . In 
contrast, there was no detectable activity of rEDN or 
rEDN(j^.2i) rOncG26 with UpG, an optimal Onconase substrate 
(Ardelt et a J . (1994) Protein Sci 3, Suppl. 1, 137). in 
summary, both [Met- ( - 1) ] rOnc and rEDN(^.2i) rOnc are less 
enzymatically active than nOnc or rEDN, Although, 
rEDN(3^.2i)rOncG2 6 expresses significant EDN-like enzymatic 
activity when assayed using defined substrates and conditions 
optimal for EDN, it is not as active as EDN in any assay. 
This could result from an impaired enzyme substrate 
interaction or from the use of suboptimal assay conditions for 
this hybrid- enzyme . — 
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Table 1 Activity of [Met-(-l)]rOnc or Hybrid Proteins in the Rabbit Reticulocyte 
Lysate compared to rEDN r nOnc in the Presence r Absence of FRI 



IC»' (ng/ml) 



(')PRI (+)PRI Fold Difference 



nOoc 


0.2 


0,24 


1.2 


rEDN 


1.3 


>40 


>30.7 


[Met-(-l)]TOnc 


96 


140 


1.4 


[Met-(-l)]rOncG20 


28 


24 


0,9 


rEDN(,.2ijrOnc 


1600 


3200 


2 


rEDNo.2i)rOncG26 


28 


600 


21 



•ICjo is the concentration of protein necessary to inhibit protein synthesis by 56% In the 
rabbit reticulocyte lysate. Data points result from the average of at least three assays. 
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Table 2 Activity of RNases on DifTerent Substrates 











RNase Activity 


(units/mg 




Substrate 


Assay pH 


rEDN 


aOac 






rEDN,.2,,OncGas 


Yeast RNA* 


6.0 


6000 


560 


0 01 


Q 

o 


120 


tRNA*-* 


6.0 


1100 


390 


12 


4 


340 


tRNA*** 


7.5 


46000 


60 


50 


130 


7400 


poly (A,0^ 


7.0 


8000 


0.04 


5 


4,5 


3900 


UpG*' 


6.5 


0.05 


0.18 


<0.01 


<0.01 


<0.01 


poly U*" 


7.0 


16.5 


0,15 


0.20 


0.35 


4.5 



les. 



defined as the changes in A^eo pw minute calculated from the slopes of the linear part of the assays. Each 
value is the average of 2-3 assays in separate experiments. 

*'Spectrophotoinetric assays were performed according to Deprisco et al. (1984) and Libonati and Floridi 
(1969) as described in Materials and Methods. Units are defined as the changes in Aj^ per minute 
calculated from the slopes of the linear part of the assays. Each value is the average of two or more 
determinations. 

T;Met-(-l)]rOncG20 had no detectable activity. 



BNSDOCID: <WO. 



> 9731116A2J_> 



wo 97/31116 



40 



PCTAJS97/02588 



P. Inhibition of protein synthesis in four Iniman ttamor cell 
lin s by RNases. The cytotoxic effect of [Met- ( -1) ] rOnc and 
the two hybrid RNases were compared to rEDN and nOnc by 
determining cell viability using the MTT assay. As depicted 
in Fig. 3, nOnc decreased tumor cell viability in all four 
human tumor cell lines. At the concentrations shown, rEDN had 
no effect on the viability of any of the cell lines. In 
contrast to nOnc, [Met- ( -1) ] rOnc as well as [Met- ( - l) ] rOncG2 0 
was consistently less cytotoxic in all four cell lines. Yet, 
rEDN^3L.2i)^OncG26 was more cytotoxic than nOnc in ACHN, human 
renal carcinoma cells and equally cytotoxic in the MDA-MB-231 
human breast carcinoma cell line. Although rEDN^^.^i) ^O^cG2 6 
was less active than nOnc in the SF-539 and HS 578T human 
glioma and breast cancer cell lines, respectively, it was 
still more active than [Met- ( -1) 1 rOnc or rEDN^^ .21, rOnc protein 
containing Asn at position 26. 

G. Structural Analysis of the hybrid RNases. Modeling the 
hybrid RNase was based on the alignment of the structures for 
One (Mosimann S.C., Ardelt W. , James M.N.G., (1994), Refined 
1.7 A X-ray crystallographic structure of P-30 protein, an 
amphibian ribonuclease with anti- tumor activity (J Mol Biol 
236, 1141-1153) and EDN (Mosimann S.C., Newton D.L,, Youle 
R.J., Jcunes M. , X-ray crystallographic structure of 
recombinant eosinophil -derived neurotoxin at 1.83A resolution 
J Mol Biol) . This and subsequent alignments were carried out 
using ALIGN (Satow Y, , Cohen G.H., Padlan E.A., Davies D.R., 
(1986) , J. Mol Biol 190, 593-604) . 

H. Modeling the structures of the hybrid RNases. The 

coordinates for One and EDN were superiti^osed on the basis of 
trace alignment. Residues in conserved zones, particularly 
in the active site, showed very little displacement when 
comparing both structures (global r.m.s.d. of 1.44 A for 90 C** 
atom pairs) . The hybrid protein was modeled by manual 
rebuilding and geometry regularization using TOM (Cambillau 
C, Horjales E., (1987), J. Mol Graph 5, 174-177). 
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Subsequently, the models for rEDN^j^.ji) and rEDN^^^. 

2i)rOncG26 were assigned an overall B- factor of 15 for all 
non- hydrogen atoms and independently subject to 3 00 cycles of 
positional energy minimization with the program XPLOR (Brunger 
A. (1992) XPLOR: a system for X-ray crystallography and NMR. , 
New Haven: Yale University Press) . The minimization yielded 
virtually identical structures in both cases, the highest 
distance based on C" trace alignment being 0.44 for the C" of 
the mutated residue 26. The geometry quality of the final 
models were assessed with PROCHECK (LasJcowski R.a. , MacArthur 
M.W., Moss D.S., Thornton J.M., (1993), J Appl Cryslzallogr 26, 
283-291) . 

The structural basis for the marked differences in 
activity between the Gly and Asp containing hybrid RNases are 
not obvious from modeling these proteins especially since 
residue 26 is distant from the active site. When the highly 
homologous structure of RNase A complexed with a 
pentanucleotide (Pontecilla- Camps J.C., deLorens R. , leDu 
M.H., Cuchillo CM., (1994), J. Biol Chem 269, 21526-21531) 
was superimposed on the structure of the hybrid protein model, 
the nucleotide was observed also to be distant from the region 

of the mutation. - However ,~t he arrangement" Of the' " ~ 

polynucleotide chain in the different RNases does not 
necessarily have to coincide. In the structure of EDN, a 
second sulfate ion was found in addition to the one in the 
active site (Mosimann S.C., Newton D.L., Youle R.J., James M. , 
X-ray crystallographic structure of recombinant eosinophil - 
derived neurotoxin at 1.83A resolution J" Afol Biol). This 
second sulfate is likely replacing a phosphate from the 
nucleotide to be cleaved, but no phosphate ion is located in 
the equivalent position in the RNase A- pentanucleotide 
complex. Moreover, one of the phosphates in this con^jlex is 
forming a salt bridge with Lys-66, a residue which has no 
counterpart in One since it is located in a loop with a 
different topology in both molecules. Thus, whether the 
difference in enzymatic activity between the Asp and Gly 
mutants in the chimera is related to a change in the binding 
affinity for the substrate remains an open question. 
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Although the structural basis for the difference in 
the activities of the two EDN-Onc hybrids is not clear, the * 
EDN-like behavior of the rEDN(3^.2i) rOncG2 6 hybrid can likely be 
attributed to the configuration of the N- terminal region since 
both the pyroglutamic acid in nOnc and Lys-i in EDN are 
located in the area of the active site (Mosimann S.C., Ardelt 
W, , James M.N.G. , (1994), Refined 1.7 A X-ray crystallographic 
structure of P-30 protein, an amphibian ribonuclease with 
anti-tumor activity J- Afol Biol 236, 1141-1153; Mosimann S.C, 
Newton D.L., Youle R,J,, James M, , X-ray crystallographic 
structure of recombinant eosinophil -derived neurotoxin at 
1.83A resolution J Mol Biol). In addition, the introduction 
of a Gly mutation in [Met- ( -1) ] rOnc did not significantly 
affect enzymatic activity. The preference of U over C in the 
Bl subsite of RNase A has been related to the presence of a 
particular residue (Asp-83) (DelCardayre S.B., Raines R,T. , 
(1995) , A residue to residue hydrogen bond mediates the 
nucleotide specificity of ribonuclease A J Afol Biol 252, 328- 
336) . The corresponding residue in nOnc is also an aspartic 
acid (Asp- 67) , while in EDN this position is occupied by a 
histidine (His -82) . EDN is more active toward poly (A,C) , 
suggesting that it "prefers" C in the Bl subsite, possibly 
because it contains a histidine residue as opposed to the 
aspartic acid in nOnc and RNase A. Taken together, this could 
explain the decreased activity of the Gly containing hybrid 
relative to rEDN since, according to this hypothesis, the 
presence of the Asp residue contributed by the rOnc sequence 
would favor the binding of U over C. With regard to the 
difference in PRI inhibition, the superposition between the 
hybrid proteins and RNase A demonstrates that Asp- 26 in the 
EDN-Onc chimeras is in the equivalent position to Asn-27 in 
RNase A that has been reported to be in contact with PRI {Kobe 
B., Deisenhofer J., (1995), Nature 374, 183-186). In 
addition. Asp- 24 in both chimeras is very close to this 
region. Thus, the accumulation of negative charges in this 
area could prevent binding by the inhibitor. If so, the 
substitution of Gly for Asp would decrease the negative charge 
and restore the binding capacity. 
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ExamplB IX. r One -Antibody Fusion Proteins 

Additional rOnc-antibody and ligand proteins have 
been produced and are highly active. E6FB [Met - ( -1) ] SerrOnc is 
an rOnc molecule having the nucleic acid sequence set out in 
SEQ ID NO: 40 and the amino acid sequence set out in SEQ ID 
NO: 41 and includes the Fv sequence from antibody E6, an anti- 
transferrin receptor antibody. See sequences for E6 at amino 
acid positions 1-237 in SEQ ID NO:41. "FB" refers to a linker 
used to link the antibody and the rOnc portion of the molecule 
and is found at nucleic acid positions 712 through 750 in SEQ 
ID N0:40. This molecule includes a Ser at amino acid position 
252 instead of a Glu. E6FB [Met- ( -1) jGlurOnc refers to the 
sequence in SEQ ID NO: 40. Similar hybrid molecules have been 
made. The nucleic acid and amino acid secjuences for Met- 
NLS (signal peptide) -Gin- rOncFBE6 are set out on SEQ ID NOS:42 
and 43. Another E6/rOnc molecule is designated Met-Ser- 
rOncA87FBE6 and is found on SEQ ID NOS:44 and 45. ''A87" 
refers to the fact that an Ala occurs at amino acid position 
87 . 

Met-Ser-rOnc-Ang-FBE6 is set out on SEQ ID N0S:46 

and 47. 

ESFBMet - Ser~r0hc~ is S€tr~6ur~on~SE"Q" Ib^NOS :"4~8 "a^d "49 T 
Met-Glu-rOncFBE6 is set out on SEQ ID NOS:50 and 51. 
Met-S€r-rOncFBE6 is set out on SEQ ID NOS:50 and 5i, 

with the exception that Ser replaces Glu at amino acid 

position 2 . 

M0C31 and MOC162 refer to anti- colon cancer 
antibodies directed against the 17 -i -a pancarcinoma antigen 
which were obtained from Dr. Hennie Hoogenboom. The Pv region 
of these antibodies was fused to rOnc. The nucleic acid and 
amino acid sequences for MetSerrOnc A87 FBM0C31 are set out on 
SEQ ID NOS:52 and 53. The nucleic acid and amino acid 
sequences for M0C31FBMetSerrOnc are set out on SEQ ID NOS : 54 
and 55. The nucleic acid and amino acid sequences for 
MetSerrOncFBMOCiei are set out on SEQ ID NOS: 56 and 57. 

The ligand, IL2 . (interleukin 2) was recombinantly 
fused to rOnc as well. See SEQ ID NOS: 58 and 59 for 
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IL2FBMetSerrOnc. See SEQ ID NOS:60 and 61 for 
MetSerrOncFBIL2 . 

Inhibition of protein synthesis in SF539 cells 
(which bear the transferrin receptor) was measured, as 
described above, for {Met- ( - l) Ser] rOnc. E6PB [Met- ( - 1) Ser] rOnc- 
tMet- ( - 1) Ser] rOnc-AngFBE6 and [Met- (-l)Glu]rOncFBE6 constructs 
and compared with nOnc. The results are shown on Table 3. 
The three E6 constructs, in particular, had a very high level 
of activity -- up to 45 fold difference over the two non-E6 
molecules. See also Figure 5. MetSerOncAng molecule was made 
corresponding to amino acids 1-107 of SEQ ID N0:47, 



RNase 


ICjo (nM) 


Fold Diiference 


nOnc 


10 


1 


[Met-(-l)Ser]rOnc 


8 


NSD 


E6FB[Met-(-l)Ser]rOnc 


0.22 


45 


[Met-(- 1 )Ser]rOnc-AngFBE6 


0.27 


37 


[Met-(- 1 )Glu]rOncFBE6 


0.50 


20 



ceUs. NSD, no significant difference. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Rybak, Susanna M. 

Newton, Oianne L. 
Boque, Lluis 
Wlodawer, Alexander 

(ii) TITLE OF INVENTION; Recombinant Ribonuclease Proteins 
(iii) NUMBER OF SEQUENCES: 39 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Tonmsend and Townsend and Crew 

(B) STREET: One Market Plaza, Steuart Street Tower 

(C) CITY: San Francisco 

(D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 94105-1492 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: PatentIn Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US Not yet assigned 

(B) FILING DATE: Not yet assigned 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Weber, Ellen Lauver 

<B) REGISTRATION NUMBER: 32,762 

(C) REFERENCE /DOCKET NUMBER: 015280-244000 

(ix) TELECOMMUNICATION INFORMATION:- 

(A) TELEPHONE: (415) 543-9600 

(B) TELEFAX: (415) 543-5043 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1 . . 104 

(D) OTHER INFORMATION: /labels nOnc 

/note= "native ONCONASE (Registered 
Trademark) from Rana pipiens" 



(ix) FEATURE: 

(A) NAME/KEY: Modified- si te 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /note= 



'Xaa = pyroglutamic acid" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Xaa Asp Trp I^u Thr Phe Gin Lys Lys His lie Thr Asn Thr Ara Asd 
1 5 10 15 

val Asp Cys Asp Asn lie Met Sex Thr Asn Leu Ph His Cys Lys Asp 
20 25 30 

Lys Asn Thr Phe He Tyr Ser Arg Pro Glu Pro Val Lys Ala He Cvs 
35 40 45 

Lys Gly He He Ala Ser Lys Asn Val Leu Thr Thr Ser Glu Phe Tyr 
50 55 go 

Leu Ser Asp Cys Asn Val Thr Ser Arg Pro Cys Lys Tyr Lys Leu Lys 
" 70 75 80 

Lys Ser Thr Asn Lys Phe Cys Val Thr Cys Glu Asn Gin Ala Pro Val 
85 90 95 

His Phe Val Gly Val Gly Ser Cys 
100 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME /KEY: Peptide 

(B) LOCATION: 1 . . 83 

<D) OTHER INFORMATION: /note= "Rana clone 9 peptide fran Rana 

pipiens genomic DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Asp Val Asp Cys Asp Asn He Met Ser Thr Asn Leu Phe His Cys Lys 
^5 10 15 

Asp Lys Asn Thr Phe He Tyr Ser Arg Pro Glu Pro Val Lys Ala He 
20 25 30 

Cys Lys Gly He He Ala Ser Lys Asn Val Leu Thr Thr Ser Glu Phe 
35 40 45 

Tyr Leu Ser Asp Cys Asn Val Thr Ser Arg Pro Cys Lys Tyr Lys Leu 
50 55 60 

Lys Lys Ser Thr Asn Lys Phe Cys Val Thr Cys Glu Asn Gin Ala Pro 
65 70 75 80 

Val His Phe 
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<2) INFORMATIOinFOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 «miino acids 

(B) TYPE: aunino acid 

(C) STRANDEDNBSS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..28 

(D) OTHER INFORMATION: /note= "N- terminal sequence of nOnc with 

Glu in place of pyxoglutamic acid in 
posxtion l« 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Glu Asp Trp Leu Thr Phe Gin Lys Lys His He Thr Asn Thr Arg Asp 
15 10 

Val Asp Cys Asp Asn He Met Ser Thr Asn Leu Phe 
20 25 

f2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNBSS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



15 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..34 

(D) OTHER INFORMATION: /note= "N- terminal sequence of 

recombinant eosinophil -de rived 
neurotoxin <rEDN) " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Lys Pro Pro Gin Phe Thr Trp Ala Gin Trp Phe Glu Thr Gin His He 
^ 5 10 x5 

Asn Met Thr Ser Gin Gin Cys Thr Asn Ala Met Gin Val He Asn Asn 
20 25 30 

Tyr Gin 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNBSS: 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: peptide 
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(ix) FKATDRE : 

(A) NAME/KEY: Peptide 

(B) LOCATION: l.,28 

(D> OTHER INFORMATION: /notes "N- terminal sequence of 

tMet- (-1) ] rOncG20, containing a Gly to 
Asp substitution at position 20 of 
[Met- (-1) ) rOnc, and without the extra 
N- terminal Met from the E. coli 
bacterial expression system" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Glu Asp Trp Leu Thr Phe Gin Lys Lys His He Thr Asn Thr Arg Asp 
15 10 15 

Val Asp Cys Gly Asn He Met Ser Thr Asn Leu Phe 
20 25 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . .34 

(D) OTHER INFORMATION: /note=: "N-terminal sequence of 

rEDNl-21rOnc, without the extra 
N-terminal Met trcm the E. coli 
bacterial expression system" 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Lys Pro Pro Gin Phe Thr Trp Ala Gin Trp Phe Glu Thr Gin His He 
i 5 10 15 

Asn Met Thr Ser Gin Asp Val Asp Cys Asp Asn He Met Ser Thr Asn 
20 25 30 

Leu Phe 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 auaino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(ix) FEATORE : 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..34 

(D) OTHER INFORMATION: /not = "N- terminal sequence of 

rEDNl-21rOncG2 6, containing a Gly to Asp 
substitution at position 26 of 
rEDNl-21rOnc, and without the extra 
N- terminal Met from the E. coli 
bacterial expression system" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Lys Pro Pro Gin Phe Thr Trp Ala Gin Trp Phe Glu Thr Gin His He 
15 10 15 

Asn Met Thr Ser Gin Asp Val Asp Cys Gly Asn He Met Ser Thr Asn 
20 25 30 

Leu Phe 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 amino acids 

(B) TYPE: amino acid 
{C) STRANDEDNESS : 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..111 

(D) OTHER INFORMATION: /note= "Frog Lectin from Rana 

catesbeiana^ 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Glu Asn Trp Ala Thr Phe Gin Gin Lys His He He Asn Thr Pro He 
15 10 15 

He Asn Cys Asn Thr He Met Asp Asn Asn He Tyr He Val Gly Glv 
20 25 30 

Gin Cys Lys Arg Val Asn Thr Phe He He Ser Ser Ala Thr Thr Val 
35 40 45 

Lys Ala He Cys Thr Gly Val He Asn Met Asn Val Leu Ser Thr Thr 
50 55 60 

Arg Phe Gin Leu Asn Thr Cys Thr Arg Thr Ser He Thr Pro Arg Pro 

70 75 80 

Cys Pro Tyr Ser Ser Arg Thr Glu Thr Asn Tyr He Cys Val Lys Cvs 
65 90 95 

Glu Asn Gin Tyr Pro Val His Phe Ala Gly He Gly Arg Cys Pro 
100 105 lio 
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(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQDENCE CHARACTERISTICS: 

(A) LENGTH: 134 amino acids 

(B) TYPE: amino acid 

(C) STRANDBDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1 . . 134 

(D) OTHER INFORMATION: /note. "Human eosinophil -derived 

neurotoxin (EDN) " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

Lys Pro Pro Gin Phe Thr Trp Ala Gin Trp Phe Glu Thr Gin His He 
^ 10 15 

Asn Met Thr Ser Gin Gin Cys Thr Asn Ala Met Gin Val He Asn Asn 

25 30 

Tyr Gin Arg Arg Cys Lys Asn Gin Asn Thr Phe Leu Leu Thr Thr Phe 

40 45 

Ala Asn Val Val Asn Val Cys Gly Asn Pro Asn Met Thr Cys Pro Ser 

55 60 

Asn Lys Thr Arg Lys Asn Cys His His Ser Gly Ser Gin Val Pro Leu 

He His Cys Asn Leu Thr Thr Pro Ser Pro Gin Asn He Ser Asn Cys 
85 90 95 

Arg Tyr Ala Gin Thr Pro Ala Asn Met Phe Tyr He Val Ala Cys Asp 

Asn Arg Asp Gin Arg Arg Asp Pro Pro Gin Tyr Pro Val Val Pro Val 
115 120 125 

His Leu Asp Arg He He 
130 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 133 amino acids 

(B) TYPE: amino acid 

(C) STRANDBDNESS: 

<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



< ix) FEATURE : 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..133 

(D) OTHER INFORMATION: /note=: "Human eosinophil cationic 

protein (ECP) 



,9731 1 1 6A2_t_> 



wo 97/31116 5^ PCT/US97/02588 



3NCE 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Arg Pro Pro Gin Phe Thr Arg Ala Gin Trp Phe Ala He Gin His He 
15 10 15 

Ser l*eu Asn Pro Pro Arg Cys Thr He Ala Met Arg Ala He Asn Asn 
20 25 30 

Tyr Arg Trp Arg Cys Lys Asn Gin Asn Thr Phe Leu Arg Thr Thr Phe 
35 40 45 

Ala Asn Val Val Aen Val Cys Gly Asn Gin Ser He Arg Cys Pro His 
SO 55 60 

Asn Arg Thr Leu Asn Asn Cys His Arg Ser Arg Phe Arg Val Pro lieu 
65 70 75 80 

I»eu His Cys Asp I^eu He Asn Pro Gly Ala Gin Asn He Ser Asn Cys 
85 90 95 

Arg Tyr Ala Asp Arg Pro Gly Arg Arg Phe Tyr Val Val Ala Cys Asp 
100 105 110 

Asn Arg Asp Pro Arg Asp Ser Pro Arg Tyr Pro Val Val Pro Val His 
115 120 125 

Leu Asp Thr Thr He 
130 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

<D) TOPOLOGY: linear 

_ J^i ) MOI£CDLE TyPE^^^ _pro.t:ein_„„ 



(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..125 

(D) OTHER INFORMATION: /note= "Bovine angiogenin (Ang) " 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Ala Gin Asp Asp Tyr Arg Tyr He His Phe Leu Thr Gin His Tyr Asp 
1 5 10 15 

Ala Lys Pro Lys Gly Arg Asn Asp Glu Tyr Cys Phe His Met Met Lys 
20 25 30 

Asn Arg Arg Leu Thr Arg Pro Cys Lys Asp Arg Asn Thr Phe He His 
35 40 45 

Gly Asn Lys Asn Asp He Lys Ala He Cys Glu Asp Arg Asn Gly Gin 
50 55 60 

Pro Tyr Arg Gly Asp Leu Arg He Ser Lys Ser Glu Phe Gin He Thr 
65 70 75 80 

He Cys Lys His Lys Gly Gly Ser Ser Arg Pro Pro Cys Arg Tyr Glv 
85 90 95 
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Ala Thr Glu Asp Ser Arg Val lie Val Val Gly Cye Glu Asn Gly Leu 
100 105 110 

Pro Val His Phe Asp Glu Ser Phe lie Thr Pro Arg His 
lis 120 125 

(2) INFORMATION FOR SEQ ID NO: 12; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 

(C) STRANDBDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Protein 
<B) LOCATION: 1..124 

(D) OTHER INFORMATION: /note= "Bovine seminal RNase" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Lys Glu Ser Ala Ala Ala Lys Phe Glu Arg Gin His Met Asp Ser Gly 
15 10 15 

Asn Ser Pro Ser Ser Ser Ser Asn Tyr Cys Asn Leu Met Met Cys Cys 
20 25 30 

Arg Lys Met Thr Gin Gly Lys Cys Lys Pro Val Asn Thr Phe Val His 
35 40 45 

Glu Ser Leu Ala Asp Val Lys Ala Val Cys Ser Gin Lys Lys Val Thr 
50 55 60 

Cys Lys Asn Gly Gin Thr Asn Cys Tyr Gin Ser Lys Ser Thr Met Arg 
^5 70 75 80 

He Thr Asp Cys Arg Glu Thr Gly Ser Ser Lys Tyr Pro Asn Cys Ala 
85 90 95 

Tyr Lys Thr Thr Gin Val Glu Lys His He He Val Ala Cys Gly Glv 
100 105 110 

Lys Pro Ser Val Pro Val His Phe Asp Ala Ser Val 
115 120 



<2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1, .124 

(D) OTHER INFORMATION: /note« "Bovine pancreatic RNase A" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Lys Glu Thr Ala Ala Ala Lys Phe Glu Arg Gin His Met Asp Ser Ser 
15 10 15 

Thr Ser Ala Ala Ser Ser Ser Asn Tyr Cys Asn Gin Met Met Lys Ser 
20 25 30 

Arg Asn Leu Thr Lys Asp Arg Cys Lys Pro Val Asn Thr Phe Val His 
35 40 45 

Glu Ser Leu Ala Asp Val Gin Ala Val Cys Ser Gin Lys Asn Val Ala 
50 55 60 

Cys Lys Asn Gly Gin Thr Asn Cys Tyr Gin Ser Tyr Ser Thr Met Ser 
65 70 75 80 

lie Thr Asp Cys Arg Glu Thr Gly Ser Ser Lys Tyr Pro Asn Cys Ala 
85 90 95 

Tyr Lys Thr Thr Gin Ala Asn Lys His lie lie Val Ala Cys Glu Gly 
100 105 110 

Asn Pro Val Val Pro Val His Phe Asp .Ala Ser Val 
115 120 * 

(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(i_x)_FEATURE.: — ^ - — — 

(A) NAME/KEY: Modified- site 
<B) LOCATION: 3 

(D) OTHER INFORMATION: /note= "Xaa s= Ser, Tyr or Thr" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Lys Xaa Pro 
1 

(2) INFORMATION FOR SEQ ID NO: 15: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /note= "Xaa = Ser, Tyr or Thr" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Met Lys Pro Xaa 
1 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) liENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note= "Xaa « Ser, Tyr 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Aen Xaa Pro 

1 



(2) INFORMATION FOR SEQ ID NO: 17; 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Modified- si te 

(B) LOCATION: 2 

(D) OTHER INFORMATION; /note= "Xaa = Ser, Tyr 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met Xaa Lye Pro 
1 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) I«AME/KBY: Modified- si te 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= "Xaa « Ser, Tyr or Thr" 
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(xi) SEQOENCE DESCRIPTION: SEQ ID NO: 18: 

Met Xaa Pro Lys 

1 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 321 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

<ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1,.321 

(D) OTHER INFORMATION: /note= "MetSerOnc86Angl04 " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

ATC TCA GAT TGG err ACA TTT CAG AAA AAA CAC ATC ACA AAC ACA AGG 48 
lie Ser Asp Trp Leu Thr Phe Gin Lys Lys His He Thr Asn Thr Arg 
15 10 xs 

GAT GTT GAT TGT GAT AAT ATC ATG TCA ACA AAC TTG TTC CAC TGC AAG 96 
Asp Val Asp Cys Asp Asn He Met Ser Thr Asn Leu Phe His Cys Lys 
20 25 30 

GAC AAG AAC ACT TTT ATC TAT TCA CGT CCT GAG CCA GTG AAG GCC ATC 144 
Asp Lys Asn Thr Phe He Tyr Ser Arg Pro Glu Pro Val Lys Ala He 
35 40 45 

TGT AAA GGA ATT ATA GCC TCC AAA AAT GTG TTA ACT ACC TCT GAG TTT 192 

Cys. Lys -Gly- He- He Ala- Ser Lys Asn Val Leu Thr Thr Ser Glu Phe" " 

55 60 

TAT CTC TCT GAT TGC AAT GTA ACA AGC AGG CCT TGC AAG TAT AAA TTA 240 
Tyr Leu Ser Asp Cys Asn Val Thr Ser Arg Pro Cys Lys Tyr Lys Leu 

'^^ 75 80 

AAG AAA TCA ACT AAT AAA m ^AT CAG GCA CCA 

Lys Lye Ser Thr Asn Lys Phe Cys Val Thr Cys Glu Asn Gin Ala Pro 
85 90 95 

GTT CAT TTT GTT CAG TCA ATT TTC CGT CGT CCG 
Val His Phe Val Gin Ser He Phe Arg Arg Pro 
100 105 



288 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQtJBNCE DBSCRIPTION: SEQ ID NO:20: 

II Ser Asp Trp Leu Thr Phe Gin Lys Lys His lie Thr Asn Thr Aro 
1 5 10 15 

Abp Val Asp Cys Aep Asn lie Met Ser Thr Asn Leu Phe His Cvb Lvs 
20 25 30 

Asp Lys Asn Thr Phe lie Tyr Ser Arg Pro Glu Pro Val Lys Ala lie 
35 40 45 

Cys Lys Gly lie lie Ala Ser Lys Asn Val Leu Thr Thr Ser Glu Phe 
50 55 60 

Tyr Leu Ser Asp Cys Asn Val Thr Ser Arg Pro Cys Lys Tyr Lys Leu 
65 70 75 ao 

Lys Lys Ser Thr Asn Lys Phe Cys Val Thr Cys Glu Asn Gin Ala Pro 
85 90 95 

Val His Phe Val Gin Ser He Phe Arg Arg Pro 
100 105 

{2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 333 t>ase pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1,,333 

(D) OTHER INFORMATION: /note= "EDNGlyOnc" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

ATG AAA CCG CCG CAG TTC ACT TGG GCT CAG TGG TTC GAA ACT CAG CAT 4 8 

Met Lys Pro Pro Gin Phe Thr Trp Ala Gin Trp Phe Glu Thr Gin His 
15 10 15 

ATC AAC ATG ACT TCT CAG GAT GTT GAT TGT GGT AAT ATC ATG TCA ACA 96 
lie Asn Met Thr Ser Gin Asp Val Asp Cys Gly Asn lie Met Ser Thr 
20 25 30 

AAC TTG TTC CAC TGC AAG GAC AAG AAC ACT TTT ATC TAT TCA CGT CCT 144 
Asn Leu Phe His Cys Lys Asp Lys Asn Thr Phe lie Tyr Ser Arg Pro 
35 40 45 

GAG CCA GTG AAG GCC ATC TGT AAA GGA ATT ATA GCC TCC AAA AAT GTG 192 
Glu Pro Val Lys Ala He Cys Lys Gly He He Ala Ser Lys Asn Val 
50 55 60 

TTA ACT ACC TCT GAG TTT TAT CTC TCT GAT TGC AAT GTA ACA AGC AGG 240 
Leu Thr Thr Ser Glu Phe Tyr Leu Ser Asp Cys Asn Val Thr Ser Ara 
€5 70 75 80 



CCT TGC AAG TAT AAA TTA AAG AAA TCA ACT AAT AAA TTT TGT GTA ACT 
Pro Cys Lye Tyr Lys Leu Lys Lys Ser Thr Asn Lys Phe Cys Val Thr 
85 90 95 



268 



BNSDOCID: <WO. 



> 9731 1 1 6A2_I_> 



wo 97/31116 5^ _ PCT/US97/a2588 



TGT GAA AAT CAG TSCA CCA GTT CAT TTT GTT GGA GTT GGA TCT TGT 333 
Cys Glu Asn Gin Ala Pro Val His Phe Val Gly Val Gly Ser Cys 
100 105 XIO 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) I£NGTH: 111 amino acids 

(B) TYPE : amino acid 
(D> TOPOLOGY: linear 

(ii) MOIiECDIiE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Met Lys Pro Pro Gin Phe Thr Trp Ala Gin Trp Phe Glu Thr Gin His 
15 10 15 

lie Asn Met Thr Ser Gin Asp Val Asp Cys Gly Asn lie Met Ser Thr 
20 25 30 

Asn I*eu Phe His Cys Lys Asp Lys Asn Thr Phe He Tyr Ser Arg Pro 
35 40 45 

Glu Pro Val Lys Ala He Cys Lys Gly He He Ala Ser Lys Asn Val 
50 55 60 

Leu Thr Thr Ser Glu Phe Tyr Leu Ser Asp Cys Asn Val Thr Ser Arg 
65 70 75 80 

Pro Cys Lys Tyr Lys Leu Lys Lys Ser Thr Asn Lys Phe Cys Val Thr 
85 90 95 

Cys Glu Asn Gin Ala Pro Val His Phe Val Gly Val Gly Ser Cys 
100 105 110 

( 2 )„ INFORMATION -FOR SEQ -ID-NO :-2 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 315 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..315 

(D) OTHER INFORMATION: /note= "MetTyrrOnc" 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

ATG TAT GAT TGG CTT ACA TTT CAG AAA AAA CAC ATC ACA AAC ACA AGG 
Met Tyr Asp Trp Leu Thr Phe Gin Lys Lys His He Thr Asn Thr Arg 
1 5 10 15 

GAT GTT GAT TGT GAT AAT ATC ATG TCA ACA AAC TTG TTC CAC TGC AAG 96 
Asp Val Asp Cys Asp Asn He Met Ser Thr Asn Leu Phe His Cys Lys 
20 25 30 



48 
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^^f^^^^^-^^ TAT TCA CGT CCT C3AG CCA GTG AAG GCC ATC 144 
Asp Lys Asn Thr Phe Thr Tyr Ser Arg Pro Glu Pro Val Lys Ala He 
35 40 45 

TGT AAA GGA ATT ATA GCC TCC AAA AAT GTG TTA ACT ACC TCT GAG TTT 192 
Cys Lys Gly He He Ala Ser Lys Asn Val Leu Thr Thr Ser Glu Phe 
50 55 gQ 

TAT CTC TCT GAT TGC AAT GTA ACA AGC AGG CCT TGC AAG TAT AAA TTA 
Tyr Leu Ser Asp Cys Asn Val Thr Ser Arg Pro Cys Lys Tyr Lys Leu 



80 



240 



288 



/U^ TCA ACT AAT AAA m AAT CAG GCA CCA 

Lys Lys Ser Thr Asn Lys Phe Cys Val Thr Cys Glu Asn Gin Ala Pro 

90 95 

GTT CAT TTT GTT GGA GTT GGA TCT TGT 

Val His Phe Val Gly Val Gly Ser Cys ^'^^ 
100 105 



(2) INFORMATION FOR SEQ ID NO: 24: 

( i ) SEQUENCE CHA RACTERISTICS : 

(A) LENGTH: 105 amino acids 

(B) TVPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Met Tyr Asp Trp Leu Thr Phe Gin Lys Lys His He Thr Asn Thr Ara 
i 5 10 15 ^ 

Asp Val Asp Cys Asp Asn He Met Ser Thr Asn Leu Phe His Cys Lys 
20 25 30 

Asp Lys Asn Thr Phe Thr Tyr Ser Arg Pro Glu Pro Val Lys Ala He 
35 40 45 

Cys Lys Gly He He Ala Ser Lys Asn Val Leu Thr Thr Ser Glu Phe 

55 60 

Tyr Leu Ser Asp Cys Asn Val Thr Ser Arg Pro Cys Lys Tyr Lys Leu 

75 80 

Lys Lys Ser Thr Asn Lys Phe Cys Val Thr Cys Glu Asn Gin Ala Pro 
85 90 95 

Val His Phe Val Gly Val Gly Ser Cys 
100 105 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 315 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(ix) FKAl 

(A) NAME/KEY: CDS 

(B) liOCATION; 1..315 

(D) OTHER INFORMATION: /note= "MetSerrOnc" 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

ATG TCA GAT TGG CTT ACA TTT CAG AAA AAA CAC ATC ACA AAC ACA AGO 48 
Met Ser Asp Trp Leu Thr Phe Gin Lye Lys His lie Thr Asn Thr Arg 
15 XO 15 

GAT GTT GAT TGT GAT AAT ATC ATG TCA ACA AAC TTG TTC CAC TGC AAG 96 
Asp Val Asp Cys Asp Asn lie Met Ser Thr Asn Leu Phe Hie Cys Lvs 
20 25 30 

GAC AAG AAC ACT TTT ACT TAT TCA CGT CCT GAG CCA GTG AAG GCC ATC 144 
Asp Lys Asn Thr Phe Thr Tyr Ser Arg Pro Glu Pro Val Lys Ala lie 
35 40 45 

TGT AAA GGA ATT ATA GCC TCC AAA AAT GTG TTA ACT ACC TCT GAG TTT 192 
Cye Lys Gly He He Ala Ser Lys Asn Val Leu Thr Thr Ser Glu Phe 
50 55 60 

TAT CTC TCT GAT TGC AAT GTA ACA AGC AGG CCT TGC AAG TAT AAA TTA 240 
Tyr Leu Ser Asp Cys Asn Val Thr Ser Arg Pro Cys Lys Tyr Lys Leu 
65 70 75 80 

AAG AAA TCA ACT AAT AAA TTT TGT GTA ACT TGT GAA AAT CAG GCA CCA 288 
Lys Lys Ser Thr Asn Lys Phe Cys Val Thr Cys Glu Asn Gin Ala Pro 
85 90 95 

GTT CAT TTT GTT GGA GTT GGA TCT TGT ^^c 
Val His Phe Val Gly Val Gly Ser Cys 
100 105 

(2) INFORMATION FOR SEQ ID NO: 26: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 aunino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Met Ser Asp Trp Leu Thr Phe Gin Lys Lys His He Thr Asn Thr Arg 
^ 5 10 15 ^ 

Asp Val Asp Cys Asp Asn He Met Ser Thr Asn Leu Phe Hie Cys Lvs 
20 25 30 

Asp Lys Asn Thr Phe Thr Tyr Ser Arg Pro Glu Pro Val Lys Ala He 
35 40 45 

Cys Lys Gly He He Ala Ser Lys Asn Val Leu Thr Thr Ser Glu Phe 
50 55 60 

Tyr Leu Ser Asp Cys Asn Val Thr Ser Arg Pro Cys Lys Tyr Lys Leu 

70 75 80 

Lys Lys Ser Thr Asn Lys Phe Cys Val Thr Cys Glu Asn Gin Ala Pro 
85 90 95 
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Val His Phe Val Gly Val Gly Ser Cys 
100 105 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 318 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECCTLE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..3ia 
(D) OTHER INFORMATION: /note= " Me tLysTyr rOnc " 




PCT/US97/02588 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

ATG AAA TAT GAT TGG CTT ACA TTT CAG AAA AAA CAC ATC ACA AAC ACA 48 
Met Lys Tyr Asp Trp Leu Thr Phe Gin Lys Lys His He Thr Asn Thr 
i 5 10 15 

AGG GAT GTT GAT TGT GAT AAT ATC ATG TCA ACA AAC TTG TTC CAC TGC 
Arg Asp Val Asp Cys Asp Asn He Met Ser Thr Asn Leu Phe His Cys 
20 25 30 

AAG GAC AAG AAC ACT TTT ATC TAT TCA CGT CCT GAG CCA GTG AAG GCC 
Lys Aap Lys Asn Thr Phe He Tyr Ser Arg Pro Glu Pro Val Lys Ala 
35 40 45 

ATC TGT AAA GGA ATT ATA GCC TCC AAA AAT GTG TTA ACT ACC TCT GAG 
He Cys Lys Gly He He Ala Ser Lys Asn Val Leu Thr Thr Ser Glu 
50 55 60 

TTT TAT CTC TCT GAT TGC AAT GTA ACA AGC AGG CCT TGC AAG TAT AAA 
Phe Tyr Leu Ser Asp Cys Asn Val Thr Ser Arg Pro Cys Lys Tyr Lys 

70 75 80 

TTA AAG AAA TCA ACT AAT AAA TTT TGT GTA ACT TGT GAA AAT CAG GCA 
Leu Lys Lys Ser Thr Asn Lys Phe Cys Val Thr Cys Glu Asn Gin Ala 
85 90 95 

CCA GTT CAT TTT GTT GGA GTT GGA TCT TGT 
Pro Val His Phe Val Gly Val Gly Ser Cys 
100 105 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

Met Lys Tyr Asp Trp Leu Thr Phe Gin Lys Lys His He Thr Asn Thr 
1 5 10 15 

Arg Asp Val Asp Cys Asp Asn He Met Ser Thr Asn Leu Phe His Cvs 
20 25 30 ^ 



96 
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D 1 

Lys Asp t,ys Asn Thr Phe He Tyrr Ser Arg Pro Glu Pro Val Lys Ala 
35 40 45 

He Cys Lys Gly He He Ala Ser Lys Asn Val liCu Thr Thr Ser Glu 
50 55 60 

Ph Tyr Leu Ser Asp Cys Asn Val Thr Ser Arg Pro Cys Lys Tyr Lys 
65 70 75 80 

Leu Lys Lys Ser Thr Asn Lys Phe Cys Val Thr Cys Glu Asn Gin Ala 
85 90 95 

Pro Val His Phe Val Gly Val Gly Ser Cys 
100 105 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 321 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLBCDLE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..321 

(D) OTHER INFORMATION: /note« "MetAlaAlaTyrrOnc" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO;29: 

ATG GCT GCT TAT GAT TGG CTT ACA TTT CAG AAA AAA CAC ATC ACA AAC 48 
Met Ala Ala Tyr Asp Trp Leu Thr Phe Gin Lys Lys His He Thr Asn 

_1 15 



ACA AGG GAT GTT GAT TGT GAT AAT ATC ATG TCA ACA AAC TTG TTC CAC 
Thr Arg Asp Val Asp Cys Asp Asn He Met Ser Thr Asn Leu Phe His 
20 25 30 



96 



TGC AAG GAC AAG AAC ACT TTT ATC TAT TCA CGT CCT GAG CCA GTG AAG 144 
Cys Lys Asp Lys Asn Thr Phe He Tyr Ser Arg Pro Glu Pro Val Lys 
35 40 45 

GCC ATC TGT AAA GGA ATT ATA GCC TCC AAA AAT GTG TTA ACT ACC TCT 192 
Ala He Cys Lys Gly He He Ala Ser Lys Asn Val Leu Thr Thr Ser 
50 55 60 

GAG TTT TAT CTC TCT GAT TGC AAT GTA ACA AGC AGG CCT TGC AAG TAT 240 
Glu Phe Tyr Leu Ser Asp Cys Asn Val Thr Ser Arg Pro Cys Lvs Tvr 
^5 70 75 80 

AAA TTA AAG AAA TCA ACT AAT AAA TTT TGT GTA ACT TGT GAA AAT CAG 288 
Lys Leu Lys Lys Ser Thr Asn Lys Phe Cys Val Thr Cys Glu Asn Gin 
85 90 95 

GCA CCA GTT CAT TTT GTT GGA GTT GGA TCT TGT 321 
Ala Pro Val His Phe Val Gly Val Gly Ser Cys 
100 105 
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(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 

Met Ala Ala Tyr Asp Trp Leu Thr Phe Gin Lye Lys His He Thr Aan 

^ 10 15 

Thr Arg Asp Val Asp Cys Asp Asn He Met Ser Thr Asn Leu Phe His 

25 30 

Cys Lys Asp Lys Asn Thr Phe He Tyr Ser Axg Pro Glu Pro Val Lys 
•^^ 40 45 

Ala lie Cys Lys Gly He lie Ala Ser Lys Asn Val Leu Thr Thr Ser 

S5 60 

Glu Phe Tyr Leu Ser Asp Cys Asn Val Thr Ser Arg Pro Cys Lys Tyr 
*^ 70 75 ' 80 

Lys Leu Lys Lys Ser Thr Asn Lys Phe Cys Val Thr Cys Glu Asn Gin 
85 90 95 

Ala Pro Val His Phe Val Gly Val Gly Ser Cys 
100 105 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 33 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..336 

(D) OTHER INFORMATION: /note^ "NLSMetSerrOnc" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

CCC AAG AAO AAG CGG AAG GTG ATG TCA GAT TGG CTT ACA CTT CAG AAA 
Pro Lys Lys Lys Arg Lys Val Met Ser Asp Trp Leu Thr Phe Gin Lys 
^ S 10 X5 

AAA CAC ATC ACA AAC ACA AGG GAT GTT GAT TGT GAT AAT ATC ATG TCA 
Lys Has He Thr Asn Thr Arg Asp Val Asp Cys Asp 721 ?IS £1? 12 
20 25 30 

ACA AAC TTG TTC CAC TGC AAG GAC AAG AAC ACT TTT ATC TAT TCA CGT 
Thr Asn Leu Phe His Cys Lys Asp Lys Asn Thr Phe I le T^ 

CCT GAG CCA GTG AAG GCC ATC TGT AAA GGA ATT ATA GCC TCC AAA AAT 
Pro Glu Pro Val Lys Ala lie Cys Lys Gly He lie Ser 



48 



96 



144 



192 



BNSDOCID: <WO 9731 1 1 6A2 J_> 



wo 97/31116 _ PCnr/US97/02588 

GTG TTA ACT ACC TCT GAG TTT TAT CTC TCT GAT TGC AAT GTA ACA AGC 24 0 

val Leu Thr Thr Ser Glu Phe Tyr Leu Ser Asp Cys Asn Val Thr Ser 
65 70 75 80 

AGG CCT TGC AAG TAT AAA TTA AAG AAA TCA ACT AAT AAA TTT TGT GTA 288 
Arg Pro Cys Lys Tyr Lys Leu Lys Lys Ser Thr Asn Lys Phe Cys Val 
85 90 95 

ACT TGT GAA AAT CAG GCA CCA GTT CAT TTT GTT GGA GTT GGA TCT TGT 33 6 

Thr Cys Glu Asn Gin Ala Pro Val His Phe Val Gly Val Gly Ser Cys 
100 105 110 

(2) INFORMATION FOR SEQ ID NO; 32: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Pro Lys Lys Lys Arg Lys Val Met Ser Asp Trp Leu Thr Phe Gin Lys 
IS 10 15 

Lys Hie He Thr Asn Thr Arg Asp Val Asp Cys Asp Asn He Met Ser 
20 25 30 

Thr Asn Leu Phe His Cys Lys Asp Lys Asn Thr Phe He Tyr Ser Arg 
35 40 45 

Pro Glu Pro Val Lys Ala He Cys Lys Gly He He Ala Ser Lys Asn 
50 55 60 

Val Leu Thr Thr Ser Glu Phe Tyr Leu Ser Asp Cys Asn Val Thr Ser 

65 70 75 8_0_ 

Arg Pro Cys Lys Tyr Lys Leu Lys Lys Ser Thr Asn Lys Phe Cys Val 
85 90 95 

Thr Cys Glu Asn Gin Ala Pro Val His Phe Val Gly Val Gly Ser Cys 
100 105 110 

<2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOXiECULE TYPE: peptide 

{ ix) FEATURE : 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= «Xaa = an aliphatic amino acid, 

Ala, Leu, He, Val, or Pro" 
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(ix) FEATURE: 

CA) NAME/KEY: Modif ied-si te 
(B) LOCATION: 3 

(D) OTHER INFORMATION: /note= "Xaa « an aliphatic amino acid, 

Ala, Leu, He, Val or Pro" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /note= "Xaa = Ser, Met, Cys, Ala or Gin" 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 33 



Cys Xaa Xaa Xaa 

1 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
Cys Val He Met 



(2) INFORMATION FOR SEQ ID NO; 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
AGRGATGTKG ATTGYGATAA YATCATG 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
TGTGARAAYC AGGCMCCWGT KCAYTTT 



BNSDOCIO: <WO 9731 1 1 6A2 J_> 



wo 97/31116 _ rrc ^ PCT/yS9 7/Q2S88 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 249 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY: - 

(B) LOCATION: 1..249 

(D) OTHER INFORMATION: /notec "Rana 9" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 

GATGTTGATT GTGATAATAT CATGTCAACA AACTTGTTCC ACTGCAAGGA CAAGAACACT 60 

TTTATCTATT CACGTCCTGA GCCAGTGAAG GCCATCTGTA AAGGAATTAT AGCCTCCAAA 12 0 

AATGTGTTAA CTACCTCTGA GTTTTATCTC TCTGATTGCA ATGTAACAAG CAGGCCTTGC 180 

AAGTATAAAT TAAAGAAATC AACTAATAAA TTTTGTGTAA CTTGTGAAAA TCAGGCACCA 24 0 
GTTCATTTT 



249 



(2) INFORMATION FOR SEQ ID NO: 38: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 315 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

( i-i )— MOLECULE -TYPE :- DNA— — 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..315 

(D) OTHER INFORMATION: /note^ " [Met - ( - 1 ) ] rOnc" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

ATG GAG GAT TGG CTT ACA TTT CAG AAA AAA CAC ATC ACA AAC ACA AGG 48 
Met Glu Asp Trp Leu Thr Phe Gin Lys Lys His He Thr Aen Thr Arg 
15 10 15 

GAT GTT GAT TGT GAT AAT ATC ATG TCA ACA AAC TTG TTC CAC TGC AAG 
Asp Val Asp Cys Asp Asn He Met Ser Thr Asn Leu Phe His Cys Lys 
20 25 30 

GAC AAG AAC ACT TIT ATC TAT TCA CGT CCT GAG CCA GTG AAG GCC ATC 144 
Asp Lys Asn Thr Phe He Tyr Ser Arg Pro Glu Pro Val Lys Ala He 
35 40 45 

TGT AAA GGA ATT ATA GCC TCC AAA AAT GTG TTA ACT ACC TCT GAG TTT 192 
Cys Lys Gly He He Ala Ser Lys Asn Val Leu Thr Thr Ser Glu Phe 
50 55 50 



96 



BNSDOCID: <WO 9731 1 1 6A2J_> 



wo 97/31116 
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TAT CTC TCT GAT TGC AAT GTA ACA AGC AGG CCT TGC AAG TAT AAA 7TA 240 
Tyr Leu Ser Aep Cye Aen Val Thr Ser Arg Pro Cys Lys Tyr Lys Leu 

75 80 

AAG AAA TCA ACT AAT AAA TTT TGT GTA ACT TGT GAA AAT CAG GCA CCA 
Lys Lys Ser Thr Asn Lys Phe Cys Val Thr Cys Glu Asn Gin Ala Pro 
85 90 95 

GTT CAT TTT GTT GGA GTT GGA TCT TGT 
Val His Phe Val Gly Val Gly Ser Cys 
100 X05 

(2) INFORMATION FOR SEQ ID NO:39: 

(i) SEQUBNCB CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Met Glu Asp Trp Leu Thr Phe Gin Lys Lys His lie Thr Asn Thr Arq 
1 5 10 15 ^ 

Asp Val Asp Cys Asp Asn lie Met Ser Thr Asn Leu Phe His Cys Lys 
2" 25 30 

Asp Lys Asn Thr Phe lie Tyr Ser Arg Pro Glu Pro Val Lys Ala He 
35 40 45 

Cys Lys Gly He He Ala Ser Lys Asn Val Leu Thr Thr Ser Glu Phe 
50 55 gQ 

Tyr Leu Ser Asp Cys Asn Val Thr Ser Arg Pro Cys Lys Tyr Lys Leu 

70 75 ' 80 

Lys Lys Ser Thr Asn Lys Phe Cys Val Thr Cys Glu Asn Gin Ala Pro 
85 90 95 

Val His Phe Val Gly Val Gly Ser Cys 
100 105 



BNSDOCID: <WO 9731 1 1 6A2 I > 
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DNA sequence 10€5 b.p. GACATCAAGATO ... GTTOGATCTIXrr linear 

; > <H6srv^ 

1/1 31 / U 

GAC ATC AAG ATC ACC CAG TCT CCA TCT TCC ATG lAT CCA TCT CTA OC5A GAG AGA GTC ACT 
asp ile ly^ mec gin ser pro ser ser roec cyr ala ser leu gly glu arc val chr 
€1 / 21 91/31 

TIC ACT TGC AAC GCC ACT CAG GAC ATT AAT AAC TAT TTA TGC TOG TTC CAG CAG AAA CCA 
phe thr cys lys ala ser gin a$p ile asn asn cyr leu cys crp phe gin gin lyc pro 
121 / 41 X51 / 51 

GOG AAA TCT CCT AAG ACC CTG ATC TAT OGT GCA AAC AGA CTG GTA GAT GOG GTC CCA TCA 
gly lys ser pro lys chr leu ile cyr arg ala asn arg leu val asp gly val pro ser 
181 / 61 211 / 71 

AOG TIC AGT GGC ACT OGA TCT GGA CAA GAT TAT TCT CIC ACC ATT AOC AGC CIC GAG TAT 
arg phe ser gly ser gly ser gly gin asp cyr ser leu cnr ile ser ser leu olu cvr 
241 / 81 271 / 91 

GAA GAT ATG GGA ATT TAT TAT TCT CTA CAG TAT GAT GAG TTT CCC TAC ACC TIC OGA GOG 

?iV ?Sf "^"^ "^^^ *«P cyr thr phe gly gly 

301 / 101 331 / 111 

GGGACCAAGCTCGAAATAAAAGGAOOCGGTCGCTCGGGCOCTGGCGGATCGaCTrGCCGGC 

?iT Jof ^^"^ "^^^ oiy ^ly giy 

361 / 121 391 / 131 



gly ser glu val gin leu gin gin ser gly chr val leu ala arp pro gly ala ser val 

451 / 151 

ATC TCC TGC AAG GCT TCT GCC TAC ACC TTT TCC AGC TAC TtM ATC CAC to; ATA AA^ 

Rl fif ^'"^ "^'^ his ^ ile ly^ 

481 / 161 511 / 171 

CAG AOG CCT GGA CAG GGT CTC GAC T(3G ATT GTC GCT ATT GAT CCT CGA AAT ACT GAT ACT 
gin arg pro gly gin gly leu asp crp ile val ala ile asp pro ^ « n ^ 
/ 181 571 / 

ArrTACAACCCGCAATTCAAACACAACCCrAAACTCACTCXACriCACCTCCACCACCACr 

6ot T 20? III III ""^ Chr ^ ch; 

G« TAG ATC GAA CTC AAC AGC CTG ACA AAT GAG GAC TCT GCC GTC TAT TAC TCT ACC CCT 
au cyr mec gju leu asn ser leu chr asn glu asp ser ala val cyr S| J« 

CTT TAT TAC rrr GAC TTTC TGG GCC CAA GGC ^ ACT §^ ACA GTC TCC TCA'GCC^A;!^ AAA 

leu cyr cyr phe asp ser -.rp ,ly gin gjy ttx r Chr l |u Chr val ser s^ Ha 1^ ^ _ _ 

CTG-AAC CA^GCr CAG GCC CCG AAG ACT Wt' Itc GAG ^T TTw" ot'' A^"m CAG AAA AAA " . 

le. asn asP .la .In ala pro lys ser asp .ecggasp crp leu Z Z ,^e, . 

CAC ATC ACA AAC ACA AGC GAT GTC GAT TCT GAT AAT ATC ATC TCA ACA AAC TTO TTC CAC 
n.s ..e ^ asn c.r ar. asp val asp cys asp asn ile mec ser chr ^n 11^ 

TK AAG GAC AAG AAC ACT rrc ATC TAT TCA OGT CCT GAG CCA CTC AAG GCC ATC TCT AAA 
cys lys asp .ys asn c.r phe Ue cyr ser arg pro glu pro val ^ tl^ ^1 

961 / jr. g,^ f^" «er asp cys asn 

CTA ACA ACC AGG CCT TCC AAG TAT AAA TTA iar &A& » 
val Chr ser arg pro cys ly s cy r ^v^ IIu *P '^'^ 

1021 / 341 ' ^ '^"^ Jy'^y" f«f "^hr asn lys phe cys vai chr 

TCT GAA AAT CAC GCA CCA GTT CAT TTT GTT GGA err 03A TCT TCT 
cys glu asn gin aia pro val his phe val giy val gly ser cys 



SEQ ID NOS:40,41 



BNSOOCID: <WO 9731 1 16A2J_> 



wo 97/31116 PCT/US97/02588 
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DNA sequence 1137 b.p. ATCCXTTCTOG^A ... ACACnrrcCTCA linear 
1/1^ I 31/11 

ATC GCT CTG GAA AAA TCT CTC ATC CTC Tit: CCC C?IG TTC TTC CIG CTC CTC CXTT TCG GTT 
met gly leu gau lys ser leu lie leu phe pro leu phe phe leu leu leu gly cxp val 
61 / 21 tri y 5'^ rni^ 91 / 31 

CAG CCC TCT CTG OCrr CAG GAT TCG CTT ACA TIT GAG AAA AAA CAC ATC ACA AAC ACA AOG 
gin pro ser leu gly gin asp crp leu chr phe gin lys lys his lie thr asn thr aro 
121 / 41 ISl / 51 

GAT GTT GAT T5T GAT AAT ATC ATO TCA ACA AAC TIG TTC CAC TK AAO GAC AA^ ACT 
asp val asp cys asp asn ile met: ser thr asn leu phe his cys lys asp lys asn thr 
181 / 61 211 / 71 

TTT ATC TAT TCAan-CCTGAGCCAGTSAAGGCC ATC TGTAAACGA ATT ATA GCCTCCA^ ^' 
ph lie cyr ser arg pro glu pro val lys ala ile cys lys gly il© n© ala ser Ivs ^ 
241 / 81 271 / 91 

AAT GTO TTA ACT ACC TCT GAG TTT TAT CTC TCT GAT TCSC AAT GTA AC\ ACC AGG CCT 
asn val leu thr thr ser glu phe cyr leu ser asp cys asn val thr ser ara pro cys 
301 / 101 331 / 111 

AAG TAT AAA TI7V AAG AAA TCA ACT AAT AAA TTT TCT GTA ACT TGT GAA AAT CAG GCA Ca 
lys cyr lys leu lys lys ser thr asn lys phe cys val thr cys glu asn gin ala pro 

/ ^21 ;rH~-j5a9i / 131 r& 

GTT CAT TTTGrrGGAGTrGGATCT TOT*GCCAAG AAACTCAACGACGCTCAGGCGCCCAAG 
val h^ p he val gly val gly ser cys ala lys lys leu asn asp ala gin ala pro lys 
421 "^144 -r) <f-u 451 / 151 

AGT gat GAC ATC AAG ATG ACC CAG TCT CCA TCT TCC ATC TAT GCA TCT CTA GGA GAG AGA 
!«f ^^"^ "^^^ ieu gly glu ary 

lii iii 511/171 

GTC ACT TTC ACT TCC AAG GCG AGT CAG GAC ATT AAT AAC TAT TIA TCC TCG TTC CAG 
III T ?«f ^^"^ Ph« cm gin 

AAA CCA GGG AAA TCTT CCT AAG ACC CTC ATC TAT OCTT GCA AAC AGA Cro GTA GAT GOG TO 

T ?m ''^^ '''''' *^ 9ly vil 

601 / 201 531 / 211 

CCA TCA AGG TTC AGT GGC AGT GGA TCT GGA CAA GAT TAT TCT CTC ACC ATT AGC ACC CTC 
^/f^^ f^? ""^^ ^^'^ «P «^ ser ser leu 

GAG TAT GAA GAT ATG GGA ATT TAT TAT TGT CTA CAG TAT GAT GAG TIT CCO TAC ACG TTC 
«lu cyr g u a.p gly iie cyr cyr cys l«u gin cyr a«p giu phe pre cjr S S 

/ ^ J. / * -^^j^ ^ 251 

C5GAGOTGGGACCAAGCTCGAAATAAAAGGAGGCOOTGGCTCGGGCOCrGGCGGATCGt5frr 

gay gly gly lys leu glu Ue lys gly ,ly gly gly ser 5^ ^ ^ ^ IS ^ ^ 

GGC go: GGC rrr GAG err CAG CTC CAG OkG Ict CTA Cro GCA AGG CCr QGC GC^ t <1 

|ly giy |ly ser glu val gin leu gin gin ser gly chr val leu ^ ^ p" ^5 SI 

TCA GTO AAG ATC ZCC TCC AAG GCT TCT GGC ^Ic A^C TO TCC AGC TAC TCG ATC CAC TOG 
ser val ly. n.e. ser cys lys ala ser gly cyr chr phe ser ser l^r^Tr.'^^.^ 

ATA AAA CAG AGG CCT GGA CAG GGT CTC GAC TO ATT GTC GCT ATT GAT CCT CCA AAT A£?r 
ae .ys g.n ar^ pro gly gin gly leu asp crp Ue val ala ile ^ SI ^ ^ 

GAT ACT ATT TAC AAC CCC CAA TTC AAA CAC AAG GCC AAA CTC ACT GCA CTTC ACC TCC ACC 

AWiA / ^«J. 1051 / 351 

AGC ACT GCC TAC ATC GAA CTC AAC AGC Cro ACA AAT GAG GAC TCT GCG CTC TAT TAC TCT 

lo^iT ""^ ^ S SI Sr ST 

ACC <^ flT TAT TAC rrr GAC TCC TC3G GGC CAA GGC ACC ACT CTC ACA CTC TCC S 
Chr pro leu cyr cyr phe asp ser crp gly gin gly Chr ^ IS IS 



SEQ ID NOS:42,43 



BNSDOCID: <W0 9731 1 16A2J_> 
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<1 / 

«r Mff 

m / 

coot OCT 

m / 
jur Tcr 
Clur Mr 
Ul / 

Ml / 

ooK err 

Ml / 
AA3 ASa 

421 / 
TQC AAO 

cy« lya 
4»i / 

TC/ OCT 

Ml / 
Mr OQC 
woiy 
SOI / 
A30 QOX 
Mt 9ly 
Ml / 

AAO ens 
lys i«u 

721 / 
GW-CTT 

711 / 

Mr eyt 
Ml / 
CC7 GBA 
pro gly 
fOV / 
AAC COS 
Ml pro 
Ml / 
A3C OAA 
MC Olu 
1021 / 
TAC TIT 

eyr ptia 



. 1 
CAT 



ai 

ATC 
Utt 

41 
CM 
0lu 

<1 
OAQ 
glu 

•1 

TCA 

Mr 

101 

OCX 

oly 

131 

ACC 
thr 
141 

occ 
«1« 

161 
AAC 

Jy« 

IBl 
ACT 
•mx 

201 
ATT 
il« 
221 
CAA 
Clu 
241 

gin 

24: 

AAC 

iy« 

281 

CAC 

gin 

301 
CAA 
cla 
22; 
CTC 
l«u 
241 
OAC 
MP 



1074 b*p. 
H4St Cu^ ^MA, /I 9> 

TQQ err ACA trr CM AAA AAA 
czp l«u ehr pbm ola iyo lys 

A» TCA ACA AAC no ttC CM 
Mc MT dur Ma 1m pbo hlo 



OCA Oro AM occ JOC nr AAA 
pro vml Xym aIa il« cy« lya 

TTT Wr CTC ICr QKT TQC AAT 
ptoo tyr lou mt map eya mm 

ACT AAT AAA mfoafoOi ACT 
car Mn lys ptefjiJgfTal ttar 

TCr OCCAM AAA exa AM 
tor eys ala lya lya lau an 

CM TCT CCA TCT TOC ATO TAT 
gin Mr pro sor Mr mc cyr 

ACT CAC CM ATT AAT AAC TAT 
Mr gla Mp 11a aan aan cyr 

ACC CTC ATC TXT OCrT OCA AM 

Uu- ivu ii« cyr arg ala am 

OOA TCT OOA CAA QKt TXT tCT 
9ly Mr gly gin aap cyr aar 

TAT TAT TCT CtA CM TMP GAT 
cyr cyr cy* lau gin cyr aap 

ATA AAA GQA GGC OCT QtSC TOO 
il« iye gly gly gly gjy sar 

CTC CAC CM-TCT OM ACr OTtA 
gir. gin aar gly ehr val 

CCr TCT CCC TAC ACC TTT TOC 
«!• s«r gly tyr chr pho ser 

CCr CX CAC TOC ATT CTC OCT 
gly i«u asp crp il9 vai «i« 

rrc AAA CM AM CCC AAA cm 

Ph« iys his lya al« lys i«u 

AAC ACC CTC ACA AAT CM OAC 
«an mmr l«ti chr asn glu Mp 

TCC TOC OCC CAA OOC AOC ACT 
Mr trp gly gin gly thr Uir 



21 / 
CM JSC 
hla Ua 
tl / 
TQC AM 

w Jyo 

m / 
OBA xrr 
«iy Ua 
211 / 
OXA ACA 
vml thr 
271 / 
lOr QAA 
eya ala 
Ul / 
QM OCT 
asp ala 
391 / 
OCA TCT 
ala aar 
491 / 

TCC 
IM cya 
811 / 
ASA CIO 

arg 1m 

fTl / 
CXC AOC 
IM Chr 
Ol / 
GM TTT 
glu gha 
Ml / 
OOC QQT 

aiy gly 

781 / 
etc OCA' 
iM ala 
811 / 
ACC TM 
aar cyr 
871 / 
ATT OAT 
iia asp 
931 / 
ACT OCA 
thr ala 
991 / 
TCT 006 
Mr ala 
1081 / 
CTC ACA 
lau thr 



11 

ACAMCACAAOiaATOrTOKrtar 
aaa thr arg aap val aap cya 

OMAMAMACrTrrASCtATtCA 
aap lya aoA thr pha 11a ^ aar 
M 

ASA GQC TQC AAA AAT 0X0 TZA ACT 
Uaalaaarlyaaaa^ml lauthor 

n 

AfiCAGQCCTTQCAMtKrAAAtm 
Mr arg pro cya lys tyr lya lau 
91 

AArCMGCAOCAGXTCXTTTrOXr 
aaa Bin ala pre Ml hla pha Ml 
Ul Fa -4~l~?r£4f#^ 
CM OQO 000 AM ACT CAT QM KK 
gin ala pro lya aar aap aap Ua 
Ul 

CTA GCaA GM AOA GTC ACT TTC ACT 
IM Oly glu arg val Chr pba thr 
191 

TOC TTC CM CM AAA OCA OBO AAA 
crp pha oln gin lya pro gly lys 

9XK OAT OQO GIC OCA TCA AQO TTC 
val aap gly vml pro aar arg plis 
191 

AR AOC AOC CXO QM TAT QAA GAT 
Ua aar aar lau olu cyr glu aap 
211 

CCD TAC AOS TIC GOA 000 000 AOC 
pre cyr thr pha gly gly gly ehr 
231 

OOC OCA T03 OCT OOC OSC OOC TCT 

gly gly sar gly gly gly gly aar 

2Sl ^ 

AGO CCT CCO^CCT tCA CTO AM ATC 
Arg pro gly aIs sar val lya mc 
271 

TOO ATQ CM TOO ATA AAA CM AOS 
crp MC his trp lla lys gin arg 
291 

CCT OOA AAT ACT GAT ACT ATT TM 
311 thr Ua cyr 

QIC ACC TOC AOC AOC ACT OCC TM 
val thr sar thr sar thr ala cyr 
331 

GTXC TXT TM TCT AOC CCT CTT TWT 
vsl cyr cyr cys thr pro 1m cyr 
391 '^ffc-^ 
ore TOC TCA CAT CM CAT 
val sar tar hla hla hla 



iir 4i>r»^ /4 ; 



ri5 



SEQ ID NOS:44»45 



BNSOOCID: <WO 9731 1 1 6A2_I_> 



wo 97/31116 
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AnrawosrTaimMiiincMiuuiJMCKCJacAaiA^ 

MP tdtsk U« Mt nr thr am Xw ptao Us «Qr> l3r« MP Sy« ttar pl» U« ^ mt 

Ul / 41 ISl / tl 

osrociFCMGCAoraMkgaoejmnrjMQQAiCTAaoeToem 

ato pro olu pee val lyo ala Ua q^o lyo aiy U« ttiA aar lya aaa ml lau thr 

iWl / «1 ni / Tl 

Acc tcr GAO Tfet cic tev oner toe amt ach juesc ago oct toe am mr aaa tta 
thr oor oJUi ^ orr ioa aor oop eyo qod val thr aar org pco eyo lys cyr tyo laa 
341 / 81 371 / n 

AA9 AAA tCA ACP AAT AAA ITT GST GTT CCT VQT <BUl AKP QEtC nA GCT QIC CAC TTO ORT 
lyo iyo eor cbr oon lyo pbo ml val ola eyo ola asn lou pco «U Mo leu agp 
301 / 101 331 / m ^ 

GftO TOV AST TIC CST OCT OOQ cCC ARO AAA C3Q AAC CAC GCT CAS GCS COB AA0 AST CBT 

OlQ OCT ilo 9to oin aro pro olo lyo ^ loa oca oop ola oln ola pro lyo oor oop 

' cic r«ic AAO /oo Aoc cng tcr cc\ tcf toc isvr gov tct tm cga crb aqsv est act 
n%",7 tlo ivii mac thr oln oor o£xi oor ooi; vex, tyr olo oor leu oly Slu oro thr 
4:^1 / 141 401 / 181 

tit: act tcc aac cotj act cao cf.c ATT Ara' mc wr rm tcc too to cm crq awi oca 
pho ctir eyo lytf ttXa ser Qln ft«p Uo acn tosn cyr l«u cy;i ctp Dtu^ ola oio ^ pro 
481 / 161 Sll / 171 

090 AAA TCT CCT AAA ACC CtO ATC TAT COT CCX KMC ACA CT9 GTA OAT GOO GTC CCA TCA 

Oiy lytt Mr pro lya thr lau 11a tyr arg ala asn aro lou val aap gly val pre aar 
S41 / 131 971 / m 

AOQ TK AOT QCC AGT GOA TCT QOA CAA GAT TAT TCT CXC ACC KPt AflC AllC CT9 OAO TKT 
arg pho oar gly aor oly aar oly ala asp cyr ear lau thr 11a aar sar lou olu tyr 
401 / 101 631 / au 

OAA GKT ATO GOA PCTV TAT TAT T9t CXA CAO TAT CSVT GAff VTT OOO tAC Afil TTC GOA OOO 

ala asp aot gly ilo cyr cyr eyo lau gla tyr asp glu pha pro cyr thr pho gly gly 
Ml / 131 391 / 331 

GCBO ACC AAO CTO QAA ATA AAA QQA GCC GOT GGC TOO QQC GOT OSC OQA TOO GOT GGC QOC 

gly thr lys lou glu Ilo lyo gly gly gly gly aor gly gly gly gly aar gly gly gty 
131 / 141 7S1 / 3S1 

GGC tCT OAO Q7T CAO CTC CAO CAO TCT OCSO ACT OTA CXO GCA AGO CCT GOO OCT TCA OTO 
Oly aor glu vol gin lou gin gla aor gly thr val lau alo arg pro gly alo cor val 
19: / Ul Qli / 171 

AAGATS-nrCTCCAASCCTTCTOOCTACAOCTTTTCCAOCTACTQaATaaVCTQaATAAAA 
lyo mot «or eyo lyo ala oar gly tyr thr pho aor aar tyr trp *ot hlo trp ilo lyo 
941 / 201 871 / 391 

CAO AGO CCT OQA CM 007 CT3 OAC TOO ATT GTC OCT ATT GMT CCT OOA AAT AOT OAT ACT 
gin arg pro giy gin gly lou oap trp Ilo val ala il« oop pro azg aon mt oap thr 
•01 / SOI 931 / 311 

ATTTACAACCCQCAATlCAAACACAftBGCCAAACXOACTGCAGTCACCTOCAOeAGCACr 
ilo tyr oon pro gin pho lyo hlo lyo ala lyo lou thr ala val thr aar thr aar thr 
tSl / 331 991 / 331 

OCCTMATOGAACXCAACMQCCTOACA AKT OAO GAC TCT GOO OXC TKT TAC TOT ACC CCT 
aU tyr mt glu lau aan oar lau tar aaa glu aop aar aU «al cyr tyr.eya thr pta 
loai / 141 xosi / 331 ^ 

crrTKrTACTrrGMtccToaoaecAAQocAocACTercACAavttocTCAGKrcAccKr 
lau cyr cyr pha aop oar trp gly gla gly thr thr lau thr val aor oar hlo hlo hU 

1091 / Ml 
TAaTPO 



SBQ ID NOS:46,47 



BNSDOCID: <WO 9731 1 1 6A2J_> 
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DTft ^s equence 1065 b.p. GACXTCAPJSKTG ... CITO 



MC ATC AAG ATS ACC CAG TCT CCA TCT TCC ATC TAT GCA TCT CTA GC» GtfU: AGA GTC 

tr tS- 

TTOACTTarAAOGTOAGTC^CSACATrAATA^ 
m T ^1 " ^ '"^ t« ^" P»»« «1« Sn ly^ 

TOAAATCTCCTAACACCCTOATCTATOTGCAAACAGAC^ 

gly lys «er pro ly. thr leu He cyr ar, aja asn arg leu val asp Sy S 

AOG TIC ACT GGC ACT GGA TCT G» CAA GAT TXT TCT ^nn «^ .„ /ll / ^ 

ar. pne ser aly gly .er .ly Si S ?S tll ^ ' 

GAA (»T ATtS Q» ATT TAT TAT TCT CTA OVC TAT GAT <»G TTT CCG T*C ACG TO 
,lu asp ,ly xle cyr cyr cy. i«, «lu phe S S 

GOG ACC AAG CTG GAA ATA AAA GGA QOC OCT GGC TCG GGC GCTT Mr nn* w 

fir r i^J "•• "V «y «y S S5 S S S S S 55 s s 
S S ST 52 SS IS » S S i S ."^ S T T ™ "= 

421 / 141 ^ « \jAy cnr vai iQu ala arg pro giy ala ser val 

?s 5^ ^ ^ SI ^ ^ ™^ ^ ^-^ ^ 

481/161 fj* **f '^'T ««c his crp ile lys 

541 / 181 H *P vai aia Ue asp pro arg asn ser asp chr 

ATT TAC AAC CCG CAA TVC AAA CAr kat* rv^ m.^^ jJ ^^ 

ST r;? :^ ^ s ^ i s ""-^ ^ 

661 / 221 f^" "f ^1 cyr tyr cys thr pro 

CTT TAT TAC TIT GAC TCC TCX5 CCC CAK GOT ^rr Jr-r 7^ ^\ * 

leu cyr .y. p^,e asp ser ^ chr ^ STf^ '^'^ ^ 

-CTC-AAC GAC-GCT- CAG- GCG-CCG-AAG ~ACT "^ A^ Tr-tT^li-'^'^Jii-'^'^ '■^^ 7 

leu asn aso ala gin ala pro lys ser a^I ^ 3^ ^ ^ AAA 

781 / 261 ^ T?^ «P leu Chr phe gin lys lys 

CAC ATC ACA AAC ACA AGG GAT GTT GAT TCT GAT aat liX — 

his ile chr asn chr arg aso val 3^1 ^A ACA AAC TIXJ TTC CAC 

841 / 26: ' ^ '^P ""^c ser Chr asn leu phe his 

TGC AAG GAC AAC AAC ACT TTT ATC TAT TCA rr4 i ~, »V1 ^A. ,(L-^ 

cys lys asp iys asn Chr pne il^ IJI 12 SI ^ ^ =^ '^■^ AAA 

901 / 301 "^^'^ ^^"^ P"^* 9lu pro val lys ala ile cys lys 

S U : a"?: sl^ :^I ^ ]^ ^ ^ ^« TAT Cn: TX^ GAT TCC AAT 

961/321 '''''^ phe cyr leu ser asp cys asn 
GTA ACA AGC AGG CCT TXIC AAG TAT AAA TTA AAr a»» 

val thr ser arg pro cys lys Cyl ^ Hi r ^ '^'^ t** ^ ^^A ACT 

TCT GAA AAT CAG CCA CCA err CAT TTT trrr criS4n.'^. 

cys glu asn gin ala pro val h J SI ^ ^ ^ ^ 



SEQ ID NOS:48,49 



BNSDOCID: <WO. 



) 9731116A2J_> 



wo 97/31116 PCT/US97/02588 
Nerr-FB-E6 -> l-phafi« Translation 

tKA sequence 1074 b.p, ATOGAOGATTOG ... TCACATCAOCAT linear 
1 / 1 V? 31/11 

ATC GAG GAT TCC CTT ACA TET GAG AAA AAA CAC ATC ACA AAC ACA AOG GAT GTT GAT T3T 
mec/glulafip trp leu chr phe gin lys lys his lie thr asn chr ary asp val asp cys 
61 i*ZCl 21 91 / 31 

GAT AAT ATC ATG TCA ACA AAC TIG TIC CAC TCC AAG GAC AAG AAC ACT TTT ATC TAT TCA J 
asp asn ile coec ser chr asn leu phe his cys lys asp lys asn thr phe lie cyr ser 
121 / 41 151 / 51 

OCT CTT GAC CCA GTO AAC OCC ATC TOT AAA CGA ATT ATA GCC TCC AAA AAT GTC TTA ACT 
arg pro glu pro val lys ala ile cys lys gly ile ile ala ser lys asn val leu thr 

.181 / 61 211 / 71 

ACC TCT GAG TTT TAT CTC TCT GAT TQC AAT GTA ACA AGC AGG CCT TOC AAG TAT AAA TTA 
Chr ser glu phe cyr leu ser asp cys asn val chr ser arg pro cys lys cyr lys leu 
241 / 81 271 / 91 

AAG AAA TCA ACT AAT AAA TIT TCT GTA ACT TCT GAA AAT CAC CCA CCA GTT CAT TTT GTT 
lys lys ser chr asn lys phe cys val chr cys glu asn gin ala pro val his phe val 

301 / 101 — •> f=*£5 331 / 111 ^ -\ — -->6(cs»r->> 

CGA GTT OGA TCT TCT GCC AAG AAA CTC AAC GAC GCT CAG GCG CCG AAG ACT GAT'gAC ATC 
gly val gly ser cys ala lys lys leu asn asp ala gin ala pro lys ser asp asp ile 
361 / 121 391 / 131 

AAC ATG ACC CAG TCT CCA TCT TCC ATC TAT GCA TCT CTA OGA GAG AGA GTC ACT TIC ACT 
lys met chr gin ser pro ser ser mec cyr ala ser leu gly glu arg val chr phe thr 
421 / 141 451 / ISl 

TGC AAG GCC ACT CAG GAC ATT AAT AAC TAT TTA TCC TGG TIC CAG CAG AAA CCA OQG AAA 
cys lys ala ser gin asp ile asn asn cyr leu cys crp phe gin gin lys pro gly lys 
481 / 161 511 / 171 

TCT CCT AAG ACC CTC ATC TAT CCT GCA AAC AGA CTC CTA GAT GGG GTC CCA TCA AGG TIC 
ser pro lys chr leu ile cyr arg ala asn arg leu val asp gly val pro ser arg phe 
541 / 181 571 / 191 

ACT GGC ACT GGA TCT GGA CAA GAT TAT TCT CTC ACC ATT ACC AGC CTC GAG TAT GAA GAT 
ser gly ser gly ser gly gin asp cyr ser leu chr ile ser ser leu glu cyr glu asp 
601 / 201 631 / 211 

ATC GGA ATT TAT TAT TCT CTA CAG TAT GAT GAG TTT CCG TAC ACG TIC GGA GGG QGC ACC 
met gly ile cyr cyr cys leu gin cyr asp glu phe pro cyr chr phe gly gly gly chr 
661 / 221 691 / 231 

AAG CTC GAA ATA AAA GGA GGC GCT GGC TCG GGC GCT QGC OGA TCG GCT GGC GGC OGC TCT 
lys leu glu ile lys gly gly gly gly ser gly gly gly gly ser gly gly gly gly ser 
721 / 241 751 / 251 

GAG GTT CAG CTC CAC CAG TCT GGG ACT CTA CTC GCA AGG CCT COG GCT TCA GTC AAG ATC 
glu val gin ieu gin gin ser gly chr val leu ala arg pro gly ala ser val lys roec 
781 / 26i 811 / 271 

TCC TGC AAG GCT TCT GGC TAC ACC TTT TCC AGC TAC TGG ATC CAC TOG ATA AAA CAG AOG 
ser cys lys ala ser gly tyr chr phe ser ser cyr crp mec his crp ile lys gin arg 
841 / 2B1 871 / 291 

CCT GGA CAG GCT CTC GAC TGG ATT GTC GCT ATT GAT CCT CGA AAT ACT GAT ACT ATT TAC 
pro gly gin giy ieu asp crp ile val ala ile asp pro arg asn ser asp Chr ile cyr 
901 / 301 931 / 311 

AAC CCG CAA TTC AAA CAC AAG GCC AAA CTC ACT GCA GTC ACC TCC ACC AGC ACT OCC TAC 
asn pro gin phe lys his lys ala lys leu chr ala val chr ser chr ser chr ala cvr 
961 / 321 991 / 331 

ATC GAA CTC AAC AGC CTC ACA AAT GAG GAC TCT GCG GTC TAT TAC TCT ACC CCT CTT TAT 
mec glu leu asn ser leu chr asn glu asp ser ala val cyr cyr cys chr pro leu cyr 
1021 / 341 1051 / 351 -t^J 

TAC TTT GAC TCC TGG GGC CAA GGC ACC ACT CTC ACA GTC TCC TCA CAT CAC CAT 
cyr phe asp ser crp gly gin gly chr chr leu thr val ser ser his his his 



SEQ ID NOS:50,51 



BNSDOCID: <WO 9731 1 16A2J_> 



wo 97/31116 PCTAJS97/02588 



1 -» X 



73 



jmtaaKrtoacrTAATTTCHOMUijyui 
■t&MrMpupX«uttarpto0lalyslya 

d. / n 

OKTMr nc Axo ten jMcns ne c»c 
321/41 

oarceroMsccKOioAMiaosjaentMu 
4n / n 

jmcraMTrrtKrcscTcarQxrtQCMr 
ttor nr aiu cyr X«a nr up 9a «aa 
241 / n r— ^ 
AM AM TO^ ACT MT AAA rtTCCrm ACT 
lys lyv «or thr aan ,lyo ptia bXah al uir 
901/101 v^^-l ' ^ 

Qtk can GBA Tcr vsrxacc aao aaa ens aac 
Oly vol oXy oor cys ola lya lys lea ma 
Ml / 121 

AMI CtQ CAO CAO TCX QQ31 OCT QAQ CIO AAO 
lya lou oXn gin aor gay pco giu lau lya 

ai / 141 

AAO CCT TCT GOO TC AOC TIC ACA AAC TC 
lya «la ser gly cyr cnr pba chr osn cyr 
4Bl / ici ^ 

AAO CCT TTA AAC TOG ATC OOC TC3 ATA AAC 
lya gly Isu lya czp msc gly up ilo asn 

OACltC AAO GGA GOO TIT CCC TTP TCT CIA 
aq» ah» ly« gly mzg pha ala pha aar lau 
ioi f aoi 

AaeAACAACCTCAAAAATaAOOACAaoaCr 
11a mm aan lau lya mma glu «ap chr ala 
MI / 221 » — fp «w •« 

QDOttCtAClQOGOCCAAGQOAOCAaaoiC 
w MP cyr trp gty gin gly chr thr ^i 
ni / 341 

OBA oor OOC TCT OOC OETT OOC GOA TOO OAC 

Til '/^ ail ^ 

OCA «_ACT CTTOtu ACA TCA CCT TOC i«C 
^ 52f Acu gly Chr aar ala aar ilo 
a^i / 3S1 

AOT AAT OaC ATC ACT TAT TIC TAT TOO TAT 

^ SiT *~ 
fStll^^^^^^^^cTraocTCA 

Ml / ^ "wt aan lau ala aor 

iSl ?^ ^ ••P thr lau arg lla aar 
WCTWCCTCAAAAfTCtAOAAATrOCTOOO 

SiT Si? P» W 

AAA 000 000 OOC OCA 
lya atg au 



21 / 
CACMC 
hia Ua 
fl / 



^ lya 

m / 

CBA AR 

Oly Ua 

au / 



u 

ACA AAC 
thr w thr 
21 

OAC MO AAC 
MA 



S S 2f S SI CO, c /V f -a 



291 / 
AAO OCT 
lyg pro 
481 / 
GOA m 
Oly aac 
811 / 
ACC TTIC 

Chr cyr 

sn / 

OAA AOC 
glu thr 
Ml / 
ACA «r 
thr ^r 
Ml / 

AOC ore 
Chr vfti 

W / 

ATT mo 

Ue ml 
•II / 

-TOC ICC 
aar oya 
871 / 
CTQ CAO 
leu gin 
921 / 
OQA die 

gly vol 
Ml / 
AOA on 
arg «ai 
1081 / 
AGO TIC 

Chr pha 



aar arg pco 
tl 

AAT GAO OCA 
•an Ola ala 

m 

CAO 600 000 
Oln alA pre 
121 

GOA GAO ACA 
Oly Olu chr 
181 

AAC TGO GIG 

aan txp ^al 
171 

ACT CCA QAO 

thr gly glu 
191 

TCr OOC ASC 
aar ala aar 
211 

TtC ICr QSA 
pba cya ala 
221 

toe ICA GOT 
aar aar gly 
281 

eZA AOC CAO 
lau Chr gin 
271 

AGO TCP ACT 
arg aar chr 
291 

AAO OCA CCC 
lya pro gly 
211 

OCA OAC AGO 

pro aap nrg 
221 

GAG OR OAO 
Olu *1« olu 
281 

GOT GOA OOC 

Oly gly oly 



AOr TIT AOC W TCA 
thr pha iia qrr aar 

AAA AAT OTO TTA ACT 
lya aaa aal lau ttor 

toe AAG TAT AAA im 
qfs lya cyr lya lau 

OQA OTT CRT TXT OTT 
pre vai hia pha i«i 

AAO ACT gat' CAO OlO 

lyo oor agp ^ 

GSC AAO AIC TOC IOC 
vol lyo 11a aer w> 

AAO ObO GOT OCA OBA 
lya Ola ala pro gly 

TCA ACA ITIT OCT GKT 

aar chr cyr ala aap 



OCT CCC TAT TXO GAO 
ala ala 1^ lau gin 

AOA TTC OCT AST AM 
asg pha ala Ua lya 

GOA GBC OCT TCA GOC 
Oly gly gly aar gly 

TCr OCA TIC TOC AAT 
aar pro pho aar aan 



AAO AiBT eic CBA arr 
lya aer lau lau hlo 

CAO TCT CCT CAO eiC 
gin OCT pro gin lau 

TIC AST ACC ACT GOO 
pha aer aor ear gly 

OKT on GOT onr w 

aap vol gly «al «yr 

AOC AAO C1Q OAA JOC 
chr lya leu glu 11a 



SBQ m NOS:52 J3 



BNSOOCID: <WO 9731 1 1 6A2J_> 



wo 97/31116 



PCT/US97/025«8 



MDCSlFBoMCsexOnc l*phastt Translation 74 

Dm sequ nee 1098 b.p. GGTCAGGTGAAG ... CTTCGATCTTGT linear 

1/1 31/11 

OCT CAG GTC AAC3 CTC CAG CAG TCA 0C3A CCT GM2 CTC AAG AAG CCT GGA GAG ACA CTTC AAC 
gly gin val lys leu gin gin ser gly pro glu l«u lys lys pro gly glu thr val lys 
61 / 21 91 / 31 

ATC TOC TCC AAG GCT TCT OOG TAC AOC TTC ACA AAC TAT GGA ATG AAC TOG GTG AAG CAG 
ile 8«r cys lys ala sar gly tyr chr pha chr asn cyr gly mac am crp vaX lys gin 
121 / 41 151 / 51 

GCT CCA OGA AAG QGT TTA AAG TOG ATC OQC TOG AXA AAC ACC TAC ACT QGA GAG TCA ACA 
ala pro gly lys gly leu lys trp mat gly crp ile asn thr cyr chr gly glu ser thr 
IBl / 61 211 / 71 

TAT GCT GAT GAC TTC AAC GGA CGG TIT CCC TTT TCT CTA GAA ACC TCT GCC AGC OCT GOC 
cyr ala asp asp pti^ lys gly arg pha ala phe ser leu glu thr ser ala ser ala ala 

241 / 81 271 / 91 Mc<^'^f'*^^ 

TAT TIC CAG ATC AAC AAC CTC AAA AAT GAG GAC ACC GCT ACA TAT TTC TCTT GCA ASA TTC 
cyr leu gin ile asn asn leu lys asn glu asp chr ala thr cyr phe cys ala arg phe 

301 / 101 331 / 111 

GCT ATT AAC GGG GAC TAC TOC QGC CAA GOG ACC ACG GTC ACC GTC TOC TCA OCT OGA OOC 
ala ile lys gly asp cyr trp gly gin gly thr thr val thr val ser ser gly gly gly 
361 / 121 391 / 131 

QGT TCA GGC GGA OCT GGC TCT QGC OCT GCC OGA TCG GAC ATT GTC CIA ACC CAG TCT CCA 
gly ser gly gly gly gly ser gly gly gly gly ser asp ile val leu thr gin ser pro 
421 / 141 451 / 151 

TTC TCC AAT CCA GTC ACT CTT GGA ACA TCA GCT TCC ATC TCC TOC AOS TCT ACT AAG ACT 
phe ser asn pro val chr leu gly thr ser ala ser ile ser cys arg ser thr lys ser 
481 / 161 511 / 171 

CTC CTA CAT ACT AAT GGC ATC ACT TAT TTC TAT TOG TAT CTC CAG AAG CCA GGC CAG TCT 
leu leu his ser asn gly ile thr cyr leu tyr trp tyr leu gin lys pro gly gin ser 
541 / 181 571 / 191 

CCT CAC CTTC CTC ATT TAT CAG ATC TCC AAC CTT GCC TCA OGA GTC CCA GAC ACG TTC ACT 

pro gin leu leu ile tyr gin met ser asn leu ala ser gly val pro asp arg phe ser 
601 / 201 631 / 211 

AGC ACT GOG TCA GGA ACT GAT TTC ACA CTC AGA ATC AGC AGA GTC GAG OCT GAG GAT GTC 
ser ser gly ser gly thr asp phe thr leu arg ile ser arg val glu ala glu asp val 
661 / 221 691 / 231 

GCT GTT TAT TAC TCT GCT CAA AAT CTA GAA ATT CCT CGG ACG TTC OCT GGA GGC ACC AAG 
gly vai cyr cyr cys ala gin asn leu glu ile pro arg thr phe gly gly gly thr Ivs 
721 / 241 ^1 -> i-ib 751 / 251 

CTC GAA ATC AAA CGG GCG »GCC GCA GCC AAG AAA CTC AAC GAC GCT CAG GCG CCC AAG ACT 
leu giu lie lys arg ala ala ala ala lys lys leu asn asp ala gin ala pro lys ser 

1 / / 261 »> >rte t si i-*- C: 811 / 271 

GAT ATC TCA GAT TCG CCT ACA TTT CAC AAA AAA CAC ATC ACA AAC ACA AGG GAT CTT GAT 

asp met ser asp crp ieu chr phe gin lys lys his ile chr asn chr arg asp val aso 

841 / 281 871 / 291 '-hp 

TCT GAT AAT ATC ATC TCA ACA AAC TTC TTC CAC TCC AAG GAC AAG AAC ACT TTT ATC TAT 

cys asp asn ile inet ser chr asn leu phe his cys lys asp lys asn chr phe ile cyr 

901 / 301 931 / 311 . 

TCA CCT CCT GAG CCA GTC AAG GCC ATC TCT AAA OGA ATT ATA GCC TCC AAA AAT CTTC TIA H'le'- 1 Or^ '^ 

ser arg pro glu pro val lys ala ile cys lys gly ile iie ala ser lys asn val leu 

961 / 321 991 / 

ACT ACC TCT GAC TTT TAT CTC TCT GAT TCC AAT CTA ACA AGC AGG CCT TCC AAG TAT AAA 
Chr chr ser glu phe tyr leu ser asp cys asn val chr ser arg pro cys lys cyr lys 
1021 / 341 1051 / 351 

TTA AAG AAA TCA ACT AAT AAA TTT TCT CTA ACT TCT GAA AAT CAG CCA CCA GTT CAT TIT 
thr asn_lys phe cys val chr cys glu asn gin ala pro val his phe 

1051 / 3 SI V j 

GTT OGA CTT GGA TCT TCT 
val gly val gly ser cys 



SEQ ID NOS:54,55 



BNSDOCID: <WO 9731 1 16A2J_> 



_ W0 97/5U16 PCT/US97/02588 
MecsezoncF1BMDC161 -> Translati n 

DHA sequ nee 1065 b.p. ATCTTCMfiTTOG ... GAOCTCAAAOC3G linear 
I / I 31/11 

ATC TCA GAT TCG CTT ACA TIT CAG AAA AAA CAC ATC ACA AAC ACA AOG GAT GTT GAT TCT 
mec ser asp crp leu thr phe gin lys lyc his lie thr asn thr ary asp val asp cys 
61/ 21 91/31 

GAT AAT ATC ATS TCA ACA AAC TPG TTC CAC TOC AAC GAC AAC AAC ACT TIT ATC TAT TCA ^ 
asp asn lie mec ser chr asn leu phe his cys lys asp lys asn thr phe ile cyr ser 
121 / 41 151 / 51 

CCT CCT GAG CCA CTTO AAG CCC ATC TCTT AAA OGA ATT ATA GCC TCr AAA AAT <nG 

arg pro glu pro val lys ala ile cys lys gly ile ile ala ser lys asn val leu thr 

181 / €1 211 / 71 

AO: TCT GAG TIT TAT CTC TCT GAT TCC AAT GTA ACA ACC AOG CCT TO AAG TAT AAA 
Chr ser glu phe cyr leu ser asp cys asn val thr ser arg pro cys lys tyr lys leu 
241 / 81 271 / 91 

AAGAAATCAACTAATAAATrrTCTGTAACTTCTGAAAATCACGCACCAGTrCATTrT 
301 101 '^^^"/yy^^^° val thr cys glu asn gin ala pro val his phe ^ 

GGA GTT GGA TCtWUca^^ AAACTOAAcS^OCTCA^GCCCCGAACACT S^fel^ ' ' ' " 

?iT «p «P ^ 

CAA CAG CAG TCA GGA ACT GAG Cro ATA AGG CCr OCG ACT TCA 

gin leu gin gin ser gly thr glu leu ile arg pro gly thr ser val lys ile ser 

' 451 / 151 

AAC Orr TCT GGA TAC GCC TTC ACT Ca«: TAC TO3 CTA OCn- TO! CTTA AAA 

Bl '"'^ ^^'^ '^'T* lys arS So ?S 

481 / 151 5JJ / 171 *- V J 

CAT GGA CTT GAG TCG ATT GGA GAT ATT TAC CCT OCA ACT GAT AAT ACT TAC TAC AAT GJkC 
hxs gl. leu ,10 czp Ue ,ly „p ile cyr pro ,ly ser asp «n ^ S? «J ^ 

AAA TTC AAC GGC AAA GCC ACA CTC ACT ACA GAC A^A TTO AGC ACA GC^ 

lys Phe lys ,ly ly. ala chr leu .hr Chr a»p ly. «er ser .er cS ^ IJr 

CTC ACT AGC CTC ACA TCT GAG GAC TCT CCT GTC TAT TTC TOT GCA AGC GGC CTT A^ 

leu ser ser leu chr ser glu asp ser ala val rvr r^^,« ^.l , . 

661 / 221 ^* ^/ PJ* "'^ gly leu Xys gly 

GAC TAC TK GGC CAA GGC ACC ACC CnC ACC Crc TCC TXIA GGT GGA GGC OCT TCA OGC CCA H <>••• ' " 

asp .yr .r, ,.v .In .ly cHr chr val chr val ser ser ^ 5^ ^ 

--GG= £^ TTT- GGS-CXrr CWC-GGA--T^^ - " - 

%\ '7 111 ^^5^ 9ln IS IS ^ 

s^ S ^5 ^ chr t]^ c^!^ 
8^1 / :e-. " ■'^'^ ^^"^ ile lys lys ser 

ATA CCT TCC TAC CAA CAC AAC crT rj-» i»» 

;.e a.^ -ro -y ^ ^''^ CCT AGG CTC CTC ATT CAT TAC ACA TCT 

9o; / 301 ' fj^y P'-° ^'■9 ile his cyr chr ser 

^ S" IS '^^ ^ «^«A<=^ T^ 

96: / i2! arg phe ser gly ser gly ser gly glu glu cyr ser 

pn; t^". =tu ITo Si S^ ^ B ^ CAG tat gat 

AAT err COG ACC TTC CCT GGA GCC ACC AAC CTG GAG CTO AAaW 
asn leu arc .hr phe gly gly giy .^.^ lys leu ^ ^ ^ 



SEQ ID NOS:56,57 



BNSDOCID: <WO 9731 1 1 6A2J_> 



wo 97/31116 



PCTAJS97/02588 



rL2Fa<«ts«rOnc -> l-pha«e Translation 75 

DN^ sequ nee 753 b.p. GCACCTACTrCA ... GTItjaATCTTOT linear 

1 / 1 ' 31 / 11 

GCACCTACTTCAACTTCrACAAAGAAAAavau;crAaUC^ 

ala pro thr ser chr ser thr lye lys thr gin leu gin leu glu hie leu 1 u leu asp 
61 / 21 91 / 31 

TTA CMS ATC ATT rro AAT OGA ATT AAT AAT TAC AM AAT CCC AAA CIC ACC ACG ATC CIC 
leu gin met lie leu asn giy iie asn asn Cyr lyc asn pro lys leu thr arg met leu 
121 / 41 ISl / 51 

ACA TTT AAG TTT ™c ATC CCC AAG AAC GCC ACA GAA CTC AAA CAT OT 

thr phe lys phe cyr mec pro lys lys ala chr glu leu lys his leu gin cys leu alu 

■181 / 61 211 / 71 

GAAGAACTCAAAOCTCrcGACGAAGTCCTAAATTrACCTCAAAKTAAAAACTITC^ 
glu glu leu lys pro leu glu glu val leu asn leu ala gin ser lys asn phe his leu 
241 / 81 271 / 91 

AGA CCC AOG GAC TTA ATC AGC AAT ATC AAC CTA ATA <7IT CT3 GAA CIA AAC GCyV trr ^ 

Vr!i Oiu leu lys gly ser glu 

301 / 101 331 / 111 

^ ^ ^ ^ «=A ACC ATT <rCA GAA ^ 

thr thr phe met cys glu tyr ala asp glu chr ala thr die val glu phe leu asn arg 
^ 391 /-^frr/- — s> PA 

TGG ATT ACC TIT TCT CAA AGO ATC ATC TCA ACA Cro ACT^OCC AAC AAA CTO AAC GAC GCT 

\7i V^^U^ ^Jl "'^^ ^^p 

gin a a pro lys ser asp met ser asp crp leu thr phe gin lys ij^ Ss tl^ ^ 

AOV AGS <»T GTT GAT TXrr AAT ATC ATC TCA ACJV AAC TTO TTC CAC TCC AAG «^ 
Chr arg asp val asp cy. asp asn ile n.ec «r chr «n leu phe hi, cys ^ ^p 

tSi T "'^ ^y** ly« ile ala 

bui / ^01 ^31 / 211 

I^' «n ^ Af-^ W TAT CTC TCT GAT TCC AAT GTA ACA ACC AOG 

III / \Ti " ^ «n ^1 Chr ser arg 

CCT TCC AAG tat AAA TTA AAG AAA TCA act AAT AAA tit TCT GTA act TCT GAA AAT CAG 
pro cys lys .yr lys leu lys lys ser thr asn lys phe cys ^1 t^ 51 ^ 

GCA CCA CTT CAT TTT CTT GGA GTT GGA TCt'tCT ^ 
aia pro vai h.:s pne val cly val gly ser cys 



SEQ ID NOS:58,59 



BNSDOCID: <WO 9731 1 1 6A2_L> 



W052/31116 ^ _ _PCiyUS97/02588 



MecserOncFBIL2 -> i -phase Translation 



77 



CN^ sequence 768 b.p. ATOICAGATTOC ... CACOlTrAATAG linear 



^1/1 31/11 
ATC TCA Cy^T TO CTT ACA TTT CAC AAA AAA CAC AlC ACA AAC A» AOG «^ 

met ser asp trp leu thr phe gin lys lyc his ile thr asn thr ary asp val asp cvs >vi ^-t" Xlfis^,. 

61/21 91/31 

C»T AAT ATC ATC TCA ACA AAC TIG TTC CAC TO AAC CUU: AAG A^ 

asp asn ile nee ser thr asn leu phe his cys lys asp lys asn thr phe ile cyr ser 
121 / 41 X51 / 51 

CCT CCT C»C CCA Cro AAC GCC ATC TCTT AAA OCa^ ATT ATA CXX TO A^ 

arg pro glu pro val lys ala lie cys lys gly ile ile ala ser lys asn val leu 2hr 

_ ' ** 211 / 71 

ACC TCT CM -m T^T Cnc TCT «T TO A^^ 

thr «er glu phe cyr leu s.r «p cy« *.n val thr .er .rg pit, cys S 

ly. lys ser thr ._y, ph^^. v.l thr cy. «lu «1„ .1. pro ^ S^I SI S 

GGA CTT oca TCrr TGTf GCC AAOAAACTOAACGACQCTCACGCCCCCAJICAffr Gw'S^OT ^ 
!« "^x «P 111 pro ly. ^ ^ S 

«^ r ^ t*° t** "° ^ <^ «T TTA cro CTX3 C»T TTA 

thr ser thr ser chr ly, lys thr gin leu gin leu glu his leu leu leu 

ATC ATT TTC AAT OGA ATT AAT AAT TAC AAG il^T C^C iS'ciC ACC Affi ATC CTC ACA TTT 
met xle leu «n gly Ue asn asn tyr ly, asn pro lys leu thr .ry ^IIT ?S ?S SI 

AACTCTTACATCCXCAAGAACaXACAGAAOTALlWTCITCACTOrcrAG^ 

lys Phe tyr ^et pro lys lys al, thr glu leu lys his leu gin SI ^ ^ ^ 

CTC AAA CCT CTC GAG GAA GTC CTA AAT TTA GCT caa Arj- aaa a*.- 

™ ^ ^ ^ s 12 :s s " 
^ it: "i^ :s tis .'is ?™ s s °H r 

661 / 22! ^^"^ ser glu nhr chr 
TTC ATC TCT GAA TAT CCT CAT GAG ACA CCA ACC ATT rrl r-KA ^ 
S^^ J^. glu tyr ala asp S ^ ^JS t]I 

^^r "^^^ '^'^ TCA ACA CnTXc^tlT CAT TAA TAG 

Chr Phe cv= ser Ue ,le ser thr leu thr his hxs ^il ^ 



SEQ ID NOS:60,61 



BNSOOCIO: <WO. 



) 9731116A2J_> 
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DMA ««quence 387 b.p. ATO0C3TCTOGAA ... CTTIOSATCTTCT linear 

'l / 1 ^ * ' 31/11 

ATS GCT CTC GAA AAA TCT CTO ATC CTC TTC CCG CT3 TTC TTC CTC CIG CTC GCTT TOG GTT 
mac gly leu glu lys sar lau ila lau pha pro lau pha p^e leu lau leu gly crp val 
61/21 ^ cvtw 91 / 31 

CAG CCC TCT CTC OCT CAC GAT TOG CTT ACA TIT CAC AAA AAA CAC ATC ACA AAC ACA AQG 
gin pro aar lau gly gin asp crp lau chr p»ie gin lys lya his ila chr asn chr arg 
121 / 41 151 / 51 

GAT CTT GAT TOT GAT AAT ATC ATC TCA ACA AAC TIG TIC CAC TCC AAC GAC AAG AAC ACT 
asp val asp cys asp asn ila mac sar thr asn lau phe his cys lys asp lys asn chr 
IBl / 61 211/71 

TTT ATC TAT TCA OTT OCT GAG CCA CTC AAC CCC ATC TCT AAA OCA ATT ATA GCC TCC AAA 
phe ila cyr ser arg pro glu pro val lys ala ila cys ly« gly ile ile ala sar Ivs 
241 / 81 271 / 91 

AAT CTC TTA ACT ACC TCT GAG TIT TAT CTC TCT GAT TCC AAT GTA ACA ACC AGG CCT TCC 
asn val lau thr chr ser glu phe cyr lau sar asp cys asn val thr ser arg pro cys 
301 / 101 331 / xil 

AAG TAT AAA TTA AAG AAA TCA ACT AAT AAA TIT TOT GTA ACT TCT GAA AAT CAC OCA CCA 
lys cyr lys leu lys lys sar thr asn lys pha cys val thr cys ylu asn gin ala pro 

361 / 121 

GTT CAT TTT CTT GGA CTT QGA TCT TCT 
val hi£ phe val gly val gly ser cys 



SEQ ID NOS:62.63 
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WHAT IS CLAIMED IS : 

1 1. A ribonuclease molecule that has (a) measurctble 

2 ribonuclease activity; (b) an amino terminal end beginning with a 

3 methionine which is followed by amy amino acid other than glutamic acid; 

4 (c) a cysteine at amino acid positions 26, 40, 58, 84, 95 and 110; a 

5 lysine at position 41 and a histidine at position 119, such positions 

€ being determined with reference to those specified amino acid positions of 

7 bovine RNase A (SEQ ID NO: 13), and (d) an nOnc-derived amino acid 

8 sequence . 

1 2, The ribonuclease of claim 1 which has an amino terminal 

2 end selected from the group consisting of: Met-Lys; Met-Tyr; Met-Ser; Met* 

3 Ala; Met-Arg; and Met-Asn. 

1 3. The ribonuclease of claim l, which has an amino terminal 

2 end selected from the group consisting of; 

3 Met -Ala; 

4 Met -Ala-Ala; 

5 Met-Ala-Ala-Ser; 

6 Met-Arg; 

7 Met- (J); 

8 Met-Lys- (J) ; 

9 Met-Arg- <a); 

10 Met-Lys; 

11 Met-Lys-Pro; 

12 Met-Lys- (J)-- Pro (SEQ-ID -NO:-14)-; - 

13 Met-Lys -Pro- (J) (SEQ ID NO:15); 

14 Met-Asn; 

15 Met -Gin; 

16 Met-Asn- (J) ; 

17 Met-Gln- (J) ; 

19 Met-Asn- (J) -Pro (SEQ ID NO:16); 

19 Met- (J) -Lys; 

20 Met- (J) -Lys-Pro (SEQ ID NO:17); and 

21 Met- (J) -Pro-Lys (SEQ ID NO:18) ; 

22 where (J) is Ser, Tyr or Thr . 

1 4. The ribonuclease of claim 1, which has an amino terminal 

2 end of Met -Ala. 

1 5. The ribonuclease of claim l, which has an amino terminal 

2 end of Met-JVrg. 

1 6. The ribonuclease of claim l, which has an amino terminal 

2 end of Met-Lys. 
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1 7. The ribonucleaae of claim l, which has an amino terminal 

2 end of Met-Asn. 



1 8. The ribonuclease of claim l, which has an amino terminal 

2 end of Met -Gin. 

1 9. The ribonuclease of claim l, which has an amino terminal 

2 end selected from the group consisting of Met-Ser; Met-Tyr or Met-Thr. 

1 10. The ribonuclease of claim 3, wherein aspartic acid of 

2 amino acid position 2 of nonc (position 4 with reference to the sequence 



3 of bovine RNase) is 



3 



4 



7 
8 
9 
10 
11 
12 
13 
14 



1 

2 

1 
2 



deleted or replaced by Ala or Asn. 



1 11. The ribonuclease of claim l, comprising a molecule 

2 having an amino terminal end encoded by a sequence derived from the amino 



terminal end of EDN which is followed by a sequence from rOnc . 



1 12. The ribonuclease of claim li wherein the amino acid 

2 sequence is one selected from the group consisting of those sequences 



substantially identical to those of a formula: 

Met (-l)EDN^,^Onc,^,oo 
wherein Met(-l) refers to an amino terminal residue of Met; 
wherein EDN,,^ refers to a contiguous sequence of amino acids of a length 
beginning at amino acid position l of EDN (SEQ ID N0:9) and continuing to 
and including amino acid position "m" of EDN; wherein Onc,^,04> refers to a 
sequence of contiguous amino acids beginning at amino acid position "n" 
and continuing to and including amino acid position 104 as set out in SEO 
3 ID NO:l; such that: 



when m is 21, n is 16 or 17; 
^ when m is 22, n is 17, 

^ when m is 20, n is 16, 

when m is 19, n is 15, 
when m is 18, n is 14 
when m is 17, n is 12 or 13; 
when m is 16, n is 11, 12, 13 or 14; 
when m is 15, n is 10; 
when m is 14, n is 9; 
when m is 13, n is 8; and 
when m is 5, n is 1 . 



13. The ribonuclease of claim 1, comprising an amino acid 
sequence substantially identical to that of SEQ ID NO:28, 

14. The ribonuclease of claim i, comprising an amino acid 
sequence substantially identical to that of SEQ ID NO: 22. 
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15. The ribonucleaee of claim 1, comprising an amino acid 
sequence substantially identical to that of SEQ ID NO: 24. 

16. The ribonuclease of claim l, conprising an amino acid 
sequence substantially identical to that of SEQ ID NO: 26. 

17. The ribonuclease of claim l, comprising an amino acid 
sequence substantially identical to that of SEQ ID NO: 30. 

18. The ribonuclease of claim l, comprising an amino acid 
sequence substantially identical to that of SEQ ID NO: 32. 

19. The ribonuclease of claim i, which includes an amino 
acid sequence substantially identical to that of SEQ id nO:2. 



^ 20. The ribonuclease of claim l, comprising a carboxyl 

2 terminal end derived from angiogenin* 

1 21. The ribonuclease of claim 20, comprising an amino acid 

2 sequence substantially identical to that of SEQ ID NO: 20. 

1 22. An amino acid sequence substantially identical to that 

2 set out in SEQ ID NO; 2. 



23. The ribonuclease of claim 1 joined to a ligand binding 
moiety or a label . 



1 24. The ribonuclease molecule of claim 23, wherein the 

2 molecule is joined to an amtibody. 

1 25. A nucleic acid sequence encoding the amino acid sequence 

2 of claim 1. 

26. A pharmaceutical composition comprising a cytotoxic 

1 amount of a ribonuclease of claim i and a pharmaceutically acceptable 

2 caxrier. 



1 
2 



27. The pharmaceutical composition of claim 26 wherein the 
rxbonuclease is joined to a ligand binding moiety. 



1 28. A method of selectively killing cells comprising 

2 contacting cells to be hilled with a ribonuclease of claim l joined to a 
ligand bindina moietv 



3 ligand binding moiety. 

^ 29 . The 

2 nuclear localization signal. 



1 29. The ribonuclease molecule of claim i which further has a 
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1 30. The ribonucleaee molecule of claim l which further has 

2 an endoplasmic retention sequence. 

1 31. A vector comprising a nucleic acid encoding a 

2 ribonuclease of claim X . 

1 32. A host cell con^rising a nucleic acid encoding a 

2 ribonuclease of claim 1 . 
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' ' searchable claims. 

2. 1 1 As aJI searchable claims couJd be searched without effort jusufying an additional fee, this Authority did not invite payment 

of any additional fee. 

3. As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
' ' covers only those claims for which fees were paid, specificaiiy claims Nos.; 

^- [ 1 No required additional search fees were timely paid by the applicant Consequently, this International Search Report is 
resuicted to the invention first mentioned in the claims; it is covered by claims Nos.: 

Remark on Protest [ [ The additional search fees were accompanied by the applicant's protest- 

I 1 No protest accompanied the payment of additional search fees. 
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